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® MULTIPLE REINFORCEMENT EFFECTS IN 
SHORT-TERM MEMORY 


By J. W. BRELSFORD, JR.,* R. M. SHIFFRIN and R. C. ATKINSON 
Stanford University, Stanford, California 


A continuous memorizing situation was studied in which test and study 
trials alternated throughout an experimental session. The items studied were 
paired-associates. The interval between study and test for a particular item was 
randomly determined; and an item was given one, two, three or four reinforce- 
ments. A quantitative model is proposed which has two memory stores: a 
short-term store in which the subject generates a carefully controlled rehearsal 
scheme of fixed length, and a long-term store in which information is accumulated 
and lost. A large number of theoretical predictions of the model were verified 
quantitatively by the data, which confirm results of previous experiments, and 
support the hypothesis that highly structured rehearsal schemes play a major role 
in many short-term memory and learning situations. 


1. INTRODUCTION 


Atkinson & Shiffrin (1968) have proposed a multi-process model of memory 
that has been employed successfully to describe the effects of a variety of 
experimental variables. These include: list length, presentation rate and 
confidence ratings in experiments involving a discrete, independent trial 
procedure; and mode of rehearsal, size of stimulus set, and judgements of 
recency in continuous memory experiments. In all of these experiments, 
items to be recalled were presented for a single period of study with a 
subsequent test period. In so far as a given item never received more than a 
single study period (or reinforcement), these experiments may be said to 
involve memory rather than learning. The present study extends the earlier 
work into the area of paired-associate learning by giving items varying 
numbers of reinforcements. An attempt will be made in this paper to 
explain the results with a model incorporating only a minor extension of the 
theory used in the previous work. ' 

The task employed in the experiment involves a continuous presentation 
technique which makes it possible to study the learning process under conditions 
that are quite uniform and stable throughout the course of an experiment. 
(This technique is very similar to one employed by Yntema & Mueser (1960, 
1962).) In essence the task requires the subject to keep track of the randomly 
changing responses to eight different stimuli. ‘The eight stimuli are chosen 
at the start of a session and used throughout that session. A session is begun by 
presenting for study each of the eight stimuli with associated responses. 
RE EES যা ন 
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Following this initial study phase, there is a continuous series of trials, each 
trial consisting of a test phase followed by a study phase. During the test phase 
a stimulus is randomly selected from among the set of cight stimyli, and the 
subject tries to recall the response last, or most recently, associated with 
stimulus. Following the test (and the subject's attempted recall) the study 
Phase of the trial occurs. In the study phase, the same stimulus presented in 
the preceding test phase is presented, sometimes re-paired with the response 
that was previously correct, or sometimes paired with a new response; in any 
Case, the subject must study and try to remember the presented pair. In order 
to distinguish between a particular stimulus and a particular stimulus-response 
Pair, a convention is henceforth adopted that an ‘item ’ will refer to a particular 
stimulus-response pair. The number of reinforcements for a given item is 
determined Probabilistically but never exceeds four. ye 
‘The number of trials intervening between the study and test of a given 
item will be referred to as the ‘lag ’ for that item. ‘Thus, if the test occurs 
immediately following the study period, the lag is zero. If one trial intervenes 
(involving test and study on another stimulus), then the lag is one; and 8000) 
Since the stimulus tested is chosen randomly from the set of eight stimuli on 
each trial, the lag between study and the next test is distributed geometrically 
with a parameter of 1/8. The task of the subject is simply to remember the 


current responses assigned to the eight different stimuli. 
because at the time of testing 


multiple reinforcements, i.e. a 
second, third or fourth reinfo 
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Before proceeding to the model it may be helpful to illustrate the experi- 
mental procedure in some detail. Fig. 1 presents a sample sequence of trials 
from trial n to trial n+20. The stimuli were selected from the set of two digit 
numbers, and the responses were selected randomly from the alphabet. Various 
events of the type to be considered later are illustrated in Fig. 1. On trialn+2 
stimulus 22 is paired with a new response, L, and assigned three reinforcements, 
the first occurring on trial n+2. The second reinforcement occurs on trial 
n+3 after a lag of zero. After a lag of six, the third reinforcement occurs on 
trial n-+10. After a lag of eight, stimulus 22 is re-paired with a new response 
on trial n+19. Stimulus 33 is sampled for test on trial n+6 and during the 
study phase is assigned the new response, B, which is to receive two reinforce- 
ments, the second on trial n +9. Stimulus 44 is tested on trial n+4, assigned 
the new response, X, which is to receive only one reinforcement; thus, when 44 
is presented again on trial n+ 16 it is assigned another response which by chance 
also is to receive only one reinforcement, for on the next trial 44 is studied with 
response Q. Note also what constitutes a correct response. For example, 
when stimulus 33 is presented for test on trial n+ 6, the correct response is TS 
when presented for test on trials n+9 and n+15, the correct response is B; 


and when presented again, the correct response will be N. 


2. MoDEL 


‘The theory postulates three memory states: a very short-lived memory 
system called the sensory register; a temporary memory state called the short- 
term store (STS); and a more permanent memory storage state called the long- 
term store (LTS). A consideration of the time intervals and procedure used 
in the present study leads us to assume that every item is accurately recorded 
in the sensory register and then at once transferred to STS for active considera- 
tion by the subject. In what follows, then, the sensory register plays no part 
and attention will be restricted to STS and LTS. 


Short-term Store 


Information entering STS resides there for a short period, perhaps of the 
order of 15 sec., before being lost completely; this period may be extended 
indefinitely, however, by processes controlled by the Subject, such as rehearsal, 
coding, and so forth. The memory in the model is imperfect, nevertheless, 
mited amount of information may be maintained in STS 
at any one time. An immediate test of any information that has entered STS 
(within a few seconds of study, say) will result in correct retrieval of that 
information. In the present study every item is assumed to enter STS and a 
test at lag zero occurs within several seconds of study 5 therefore, the probability 
of a correct response for any item tested at lag zero is unity. For lags of one 
or greater the time intervals in the present task are such that items will be lost 
from STS unless some sort of rehearsal mechanism is invoked to maintain them. 


because only a very li 
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An analysis of subjects’ teports and a consideration of similar experiments 
(Atkinson, Brelsford & Shiffrin, 1967) lead us to Propose a very specific, quite 
orderly, rehearsal scheme called the ‘ rehearsal buffer ° or ‘ buffer °. 


Rehearsal Buffer The buffer is a rehearsal scheme in which a fixed number, 7 | 
of homogeneous items are rehearsed at any one time during an experimenta 
session. It is assumed that the series of study items at the start of each expert 


: a ole td 
+ an item currently undergoing rehearsal is eR 
to make room for the new item). It is assumed that a correct response is alway’ 

given if an item is in the buffer at the time it i. 


is, it is necessary to distinguish theoretically between 
-items and N-items. An O-item 
Y Whose stimulus is already a component 


item (new-item) is a Presented item 
uffer. 


There are several reasons for this rule. If the Presented item is receiving 


the subject bases his decisions 
ediately Preceding test phase of the trial: if on the test 

{ store the item being presented 
Otherwise, the item is entered 


em currently in the b babilit. 
1/r of being the one eliminated). OU Val le MELE 


If a correct retrieval 
» then the subject has 


" it, and will not attempt to rehearse it 
l » Of course, the Presented item is receiving its second, third 
Or fourth reinforcement). Note that a i 
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item (and not just a correctly-guessed response or a zero-lag response) is necessary 
for the subject to prohibit entry into the buffer. If the presented N-item is 
not retrieved from LTS (which is always the case if the item is being given its 
first reinforcement) then the subject does not ‘ know ° it, and rehearsal becomes 
desirable. On the other hand, there undoubtedly is a certain amount of effort 
involved in rearranging the buffer; in addition, the item which must be removed 
from the buffer to make room has not yet been tested. In consideration of these 
conflicting tendencies it seems appropriate to let the entry probability be a 
parameter, «, to be estimated. Finally, once an item is entered into the buffer, 
it is necessary to decide which item already there is to be removed. In previous 
studies using fixed lists, it has proved useful to postulate a tendency for the 
oldest items in the buffer to be the first eliminated. In all work using the 
continuous presentation technique of the present study, however, it has been 
most accurate to postulate a random choice of the item to be removed. Fairly 
direct evidence supporting this assumption will be presented later in conjunction 


with Fig. 2. 


Long-term Store 


LTS is viewed as a memory state in which information accumulates for 
each item. ‘The term ‘information’ is not used here in a technical sense. 
It refers to codes, mnemonics, images or anything else the subject might store 
that would be retrievable at the time of test. It is assumed that information 
about an item may enter LTS only during the period that an item resides in 
the buffer. The status of an item in the buffer is in no way affected by transfer 
of information to LTS. Whereas recall from the buffer was assumed to be 
perfect, recall from LTS is not necessarily perfect and usually will not be. 
At the time of test on an item, a subject gives the correct response if the item 
is in STS but if the item is not found in STS, the subject searches in LTS. 
This LTS search is called the retrieval process. ‘Two features of the LTS 
retrieval process must be specified. First, it is assumed that the likelihood of 
retrieving the correct response for a given item improves as the amount of infor- 
mation stored concerning that item increases. Second, the retrieval of an item 
gets worse, the longer the information has been stored in LTS. This could 
result from autonomous decay or active interference from other information 
being stored in LTS. 

It will be specifically assumed in this paper that information is transferred 
to LTS at a constant rate, 0, during the period that an item resides in the buffer; 
0 is the transfer rate per trial. ‘Thus, if an item remains in the buffer for exactly 
J trials, then that item accumulated an amount of information in LTS equal to 
0. Next, it is assumed that each trial following the trial on which an item is 
knocked out of the buffer causes the information stored in LTS for that item 
to decrease by a constant proportion 7. Thus, if an item receiving its first 
reinforcement were knocked out of the buffer at trial f, and i trials intervened 
between the original study and the test on that item, the amount of information 
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stored in LTS at the time of test would be (j6) (77-7). In this experiment an 
item receiving two, three or four reinforcements may enter and leave the SL 
two, three or four times. When the item is in the buffer the 0-process 1s 
activated, and when not in the buffer the 7-process takes over. 


i ich 

A Correct response for an item un 
z HN Ne oe ন 

amount of information in L'T1'S equ 


ত ER Fh j » retrieval 
to I at the time of test. Considering the above specifications on the retriev 
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Where g is the Suessing probability and, in the Present experiment, is 1/26 sin 
there were 26 response alternatives. 


becomes fairly complex. 
tj trials, then out of the b 
tested. At that point 


nations of 1’s and j’s, and 

It should be clear that this 

culating predictions, although possible, is both formidable 
» Predictions from the model were Senerated using 

hods; the procedure is described in detail later. 

it would be Appropriate to review the sequence of events and 

decisions Proposed in the model. A trial begins with the Presentation of a 


: The subject first checks STS; if the test was zero lag then 
the item is present in STS i 


with a response for study. 
the subject refers to the t 
N-items). It should be no 


7, the long-term decay Parameter. The model is identical to that used in 
Previous studies (cf. Atkinson, Brelsford & 
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that an item retrieved from LTS is never entered into the buffer, a state of 
affairs never arising in experiments where each item receives only one reinforce- 
ment. 


3. EXPERIMENT 
Method 


Subjects ‘The subjects were 20 students from Stanford University who received §2 


per experimental session. Each subject participated in at least 10 sessions. 


Apparatus The experiment was conducted in the Computer-Based Learning Labora- 
tory at Stanford University. Control functions were performed by computer programs 
running in a modified PDP-1 computer manufactured by the Digital Equipment Corpora- 
tion and under the control of a time-sharing system. ‘The subject was seated at a cathode- 
ray tube display terminal. Stimuli were displayed on the face of the cathode-ray tube 
(CRT); responses were made on an electric typewriter keyboard located immediately below 
the lower edge of the CRT. (For amore detailed description of the apparatus see Atkinson, 
Brelsford & Shiffrin, 1967.) 

Stimuli and responses ‘The stimuli were eight two-digit numbers randomly selected 
for each subject and session from the set of all two-digit numbers between 00 and 99. 
Once a set of eight stimuli was selected for a given session, it was used throughout the 
session. Responses were letters of the alphabet, thus fixing the guessing probability of a 
correct response at 1/26. 

Procedure Every session began with a series of eight study trials; one study trial for 
each stimulus used in the session. On these trials each of the eight stimuli was paired with 
a response selected randomly. ‘There were no restrictions on repetition of responses. 
After the initial study trials the session involved a series of consecutive trials, each consisting 
of a test phase followed by a study phase. On each trial a stimulus was randomly selected 
for testing, and the same stimulus was then presented for study. During the study phase 
of the trial the stimulus was sometimes re-paired with a new response and sometimes left 
paired with the old response. To be precise, when a particular stimulus-response pair was 
presented for study the first time, a decision was made as to how many reinforcements 
(study periods) it would be given ; it was given either one, two, three or four reinforcements 
with probabilities 0°30, 0°20, 0:40 and 0°10, respectively. When a particular stimulus 
response pair had received its assigned number of reinforcements, its stimulus was then 
randomly re-paired with a new response on the next study trial, and this new item was 
assigned a specific number of reinforcements using the above probability distribution. 
Reference to Fig. 1 should clarify this procedure. ‘The subject was instructed to respond 
on the test phase of each trial with the letter that was last studied with the stimulus being 
tested. Since the stimulus selected for testing was chosen randomly on each trial, the 
distribution of the lag from study to test was geometric with a parameter of 1/8. 

The temporal arrangement for the eight initial study trials was such that each study 
trial lasted for 3 sec. with a 3 sec. inter-trial interval. Each trial of the session proper 
involved the following sequence of events: (1) The word test appeared on the upper face 
of the CRT. Beneath the word test a randomly selected member of the stimulus set 
red. ‘This test portion of a trial lasted for 3 sec., during which subjects were told to 


appea: j 
respond with the last response that had been associated with that stimulus, guessing if 
necessary. (2) The CRT was blacked out for 2 sec. (3) The word study appeared on the 


CRT for 3 sec. Below the word study a stimulus-response pair appeared. 
the same one used in the preceding test portion of the trial. Depending 
t schedule, the response was either the one that had previously been 
(4) There was a 3 sec. inter-trial interval before the next trial. ‘Thus 


upper face of the 
The stimulus was 
upon the reinforcemen 
correct or anew one. 
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i ials during 
a complete trial (test plus study) took 11 sec. A subject was run for 220 hich En 
each experimental session. For each subject and session the entire sequence of p 
tions, the subjects’ responses, and response latencies were recorded. 
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Over all sessions and subjects. 


1.0 


[) 


7-0 - 3 REINFORCEMENTS 
7 -O- - 2 REINFORCEMENTS 
7 -9-- | REINFORCEMENT 
me’ THEORY 


PROBABILITY OF A CORRECT RESPONSE 


LAG 
FiGURE 2. Observed and predicted probabil 


ities of a correct response as 
for items tested following their first, s 


a function of lag 
econd or third reinforcement 


Fig. 2 presents th as a function of lag for 
» Second and third reinforcements. The number of 


» And also toward the smaller 
S have more observations because 
of reinforcements are 


after a lag of nm trials. ‘The data are 
for J equal to 1, 2 and 3. 

ern. The probability correct 
On lag 1 the greater the 


Multiple Reinforcement Effects in Short-Term Memory 9 


number of prior reinforcements. The forms of the l-reinforcement curve and 
the probability values are quite similar to those found in a previous study with 
a similar procedure (Atkinson, Brelsford & Shiffrin, 1967) in which only one 
reinforcement was given to each item. There is one immediate inference that 
can be derived from these curves: accepting for the moment the rest of the model, 
the assumption that items to be lost from the buffer are chosen randomly is 
reasonably accurate. If there were a sizable tendency for the oldest items in the 
buffer to be lost first, then the curves would exhibit a pronounced S-shaped 
effect (Atkinson & Shiffrin, 1968). 

For a deeper analysis of this experiment, certain dependencies masked 
by the curves of Fig. 2 need to be considered. For example, the probability 
of a correct response to an item that received its second reinforcement and was 
tested at some later trial will depend on the number of trials that intervened 
between the first and second reinforcements. To clarify this point consider 
the following diagram: 


lag a lag b 


22-7 lm—>?| 22 22-7 mmm] 22 


(1st study) (1st test) (2nd study) (2nd test) 


Item 22-Z is given its first reinforcement, tested at lag a and given a second 
reinforcement, and then given a second test at lag b. For a fixed lag b, the 
probability of a correct response on the second test will depend on lag a. In 
terms of the model it is easy to see why this is so. The probability correct for 
an item on the second test will depend upon the amount of information about 
itin LTS. If lag a is extremely short, then there will have been very little time 
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lag a and lag b. (See diagram in text.) 
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for LTS strength to build up. Conversely, a very long lag a will result ” ও 
strength building up to a maximum but then decaying once the item 
the buffer. Hence the probability of a correct response on the second 
will be maximal at some intermediate value of lag a; namely, at a lag which ul 
give time for LTS strength to build up, but not so much time that SE 
decay will occur. For this reason a Plot of probability correct on the ন 
test as a function of the lag between the first and second reinforcement shou d 
exhibit an inverted U-shape. Fig. 3 is such a plot. ‘The probability SA 
the second test is graphed as a function of lag a. Four curves are shown for 
different values of lag b. The four curves have not been averaged over all ET 
of lag b because we wish to indicate how the U-shaped effect changes with চন 
in lag b. Clearly, when lag bis Zero, the Probability correct is unity and ther 
is no U-shaped effect. Conversely, when lag b is very large, the probability 


correct will tend toward chance regardless of lag a, and again the U-shaped 
effect will disappear. 


5. THEORETICAL ANALYSIS 


Of-fit of the model involves a number g 
d experiment, such desirable statistics as 
€ parameters cannot be calculated; indeed, 
exact data sequence which will allow us to 


€ 
JPe of least-square De 
i-square. The criterion will 


not be distributed as chi-square.) Given a 
goodness-of-fit measure, we generate predictions for 
€e model is too complex 

te Carlo techniques are 
used. ‘The Monte Carlo Procedure involved Senerating pseudo-data on a 
computer. These data w 
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the model (and the proced jhent) Ml EE 
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analysis routine that was applied to the real data to generate Figs. 2 and 3 was 
applied to the pseudo-data to generate theoretical predictions. A large sample 
of pseudo-data was generated so that the theoretical predictions were stable and 
accurate. Having generated theoretical predictions for a given set of parameter 
values, the goodness of fit was then measured by the x* criterion: 


Fe 1 1 : 
x(n, 7, 0 0, 7) JR 5) * মম ত্য {Nec a) , 02) 


Where the sum is taken over all data points in Figs. 2 and 3. For the actual 
data, the observed number of correct responses for the ith point is denoted by 
Os, and Ni is the total number of responses. 

For the Monte Carlo data, Pr(Ci) denotes the probability of a correct 
response; of course, the value of Pr(C;) depends upon the parameter vector 
(7, «4, 8, 7). We denote by n the number of simulated trials upon which Pr(C;i) 
is based. As usual, the smaller the x? value, the closer are the predicted and 
observed data. 

In order to determine which set of parameter values provides the best fit 
according to the x criterion, a fairly exhaustive search of the parameter space 
was undertaken. (The grid search procedure is similar to that described in 
Atkinson & Crothers (1964).) Initially the search was carried out using 4,000 
different values for the parameter vector (7, o, 0,7), with a xX* value being 
generated for each vector. For that parameter vector yielding the minimum 
x2, a second, converging, search routine was then begun. These converging 
searches were continued until the set of parameters generating the minimum 
x2 was determined to two decimal places. One problem that arises when using 
a Monte Carlo procedure is determining how large a sample of pseudo-data is 
required to ensure accurate predictions. In this case, an attempt was made to 
ensure accurate predictions by generating a very large amount of pseudo-data 
for each set of parameter values: in the initial grid search each set of parameter 
values were given 200 subjects worth of pseudo-data (each subject consisting of 
2200 pseudo-trials). Asa check on the stability of the predictions, the pseudo- 
data were divided in half (100 subjects each) and separate Xs computed for 
each half [x2(1) and x°(2)]. The following inequality was then evaluated: 


XA(D) 
-95 < 25 € 1-05. 

0-95 ত 1-0 (3) 

Tf this inequality was satisfied then ¥X2 was computed for the entire 200 subjects 
worth of pseudo-data, and the value was assumed to be appropriate for that 
point in the parameter space. If the inequality failed, then the number of 
seudo-trials was doubled and the split-half test again applied. The number 
of pseudo-trials was increased in this manner until the inequality was satisfied. 
Furthermore, as the grid search began to converge on the best set of parameter 
values, the bounds of inequality (3) were narrowed so that even greater accuracy 
of predicted values would result, When the final set of parameter values was 
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Obtained, 12,500 Monte Carlo subjects were generated using those EE 
Values. In all Subsequent discussions, the predicted values are ed os 
output of this final Monte Carlo run, and it is doubtful that they re RTE 
fluctuations due to sampling error. The best set of parameter values v 
7=3; =0.65; 0=1.25; += 0-82. | ie ont 
These parameter values are in reasonable accord with those foun 


. :d to be 
buffer size of 7=3 may seem small at first glance but 7 has been estimated to 


i ; i ch 

Considering that this decay factor is applied টী old 

s One explanation for LTS strength being reduced so quickly woul hen 
i retroactive interference in this situation. If so, 


Evidence along these lines will be 
presented shortly. 


The predictions from the theory are shown as the smooth curves in Figs. 2 
and 3. Tt should be evid i 
Observed ones. Note als 


nate an item from the buffer. Similarly, the et 
Probability Correct, because ie 
buffer. Although Oitems and Ate 
কা » the probabili i rrence € 
be maximized by selectin ff apeen ee EE St 


‘“all-same ’ and all-different ? 
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FicurE 4. Observed and predicted lag curves for the all-same and all-different conditions. 


its first, second or third reinforcement. ‘The model predicts that once any 
of the intervening items enters the buffer, every succeeding item will be an O-item 
and hence will also enter the buffer (because all these intervening items have the 
same stimulus). Thus the all-same curve should decrease less (as a function of 
lag) than the unconditional lag curves presented in Fig. 2. For the all-different 
curves, on the other hand, the probability of a correct response is computed for 
instances when every intervening item between study and test utilizes a different 
stimulus. In this case, the number of intervening N-items tends to be maxi- 
mized, and hence the probability that the studied item will be knocked out of the 
buffer tends to be maximized. ‘Therefore, the all-different curves should 
decrease faster than the unconditional lag curves. It will be seen in Fig. 4 
that the all-same and all-different curves conform to these predictions. The 
solid lines in the figure represent predictions from the model using the parameter 
values estimated in the previous section; the correspondence of data and theory 
appears to be reasonably close. 

Next consider the factors determining the probability that a presented 
item will enter the buffer. Most important is the probability that the presented 
item is an O-item, since an O-item’s stimulus is already in the buffer, and thus 
every O-item enters the buffer. Because a high probability of entering the 
buffer implies a high probability of a correct response at test, it should be possible 
to manipulate the probability correct by manipulating the probability that a 
presented item is an O-item. In Fig. 5 this has been done. Consider a 
sequence of consecutive trials all utilizing the same stimulus, but with the last 
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item in the sequence being given its first reinforcement (thus its লৰ নও 
different from that of the immediately preceding item). Once any i 0 Tonge 
sequence enters the buffer, every following item will do Er) ৰব! us ৰ i 
the sequence, the greater the Probability that the last item in t ie Sa ER 
enter the buffer. Fig. 5 plots the Probability of a correct response as a 
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NUMBER OF PRECEDING ITEMS WITH THE SAME STIMULUS 


FiGURE 5. Observed and predicted probabilities o 


ls f the 
f a correct response as a function 0 
number of consecutive preceding ite 


‘mS using the same stimulus. 


. 255 
of the preceding Sequence increases. Sever ; 
en for long lags, where 


the following diagram 
lag a lag b 
22-2 mre 3 22 22-X mms] 22 
(study) (test) (Study) (test) 
Item receives k 
its jth enn oR 
reinforcement 


ie cial 22 
then tested at lag a; on this trial is 
ed next at lag b. According to t 
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theory, the shorter lag a, the better performance should be when the item 
is tested after lag b. ‘This prediction is based on the fact that the more recently 
a stimulus had appeared, the more likely that it was still in the buffer when the 
next item using it was presented for study. If the stimulus was in the buffer, 
then the item using it is an O-item and automatically enters the buffer. In the 
present analysis, we examine this effect for three conditions, i.e. the preceding 
item using the stimulus in question could have just received its first, second or 
third reinforcement. Fig. 6 presents the appropriate data. In terms of the 


T T T 
er 2 REINFORCEMENTS 


| REINFORCEMENT 


er —-0-— DATA 
THEORY 


PROBABILITY OF A CORRECT RESPONSE 


FIGURE 6. Observed and predicted probabilities of a correct response as a function of lag a. 
(See diagram in text.) 


above diagram, what is plotted is the value of lag a on the abscissa versus the 
probability of a correct response averaged over all values of lag b on the ordinate. 
‘There is a separate curve for f= 1, 2 and 3. Note that the results of Fig. 5 are 
confirmed by those of Fig. 6; again interference theory would appear to predict 
an effect opposite to that found. 

The predicted curves are based upon the previous parameter estimates. 
The predictions and observations coincide fairly well, but the effect is not 
as dramatic as one might hope. One problem is that the predicted decrease 
is not very large. Considerably stronger effects may be expected if each curve 
is separated into two components: one where the preceding item was correct 
at test and the other where the preceding item was not correct. In theory the 
decrease predicted in Fig. 6 is due to a lessened probability of the relevant 
stimulus being in the buffer as lag a increases. Since an item in the buffer is 
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always responded to correctly, an analysis made conditional upon Co 
responses or errors (the centre test in the above diagram) should magnify গম 
effect. ‘To be precise, the decrease will be accentuated for the curve SOnTItOnG 
Upon correct responses, whereas no decrease at all is predicted for the curve 
conditional upon errors. If an error is made, the relevant stimulus cannot be 
in the buffer and hence the new item enters the buffer with probability «, 
Which is independent of lag a. Fig. 7 presents the conditional curves and the 
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is fairly evident for the correct curves, 
» Are quite flat over lags. Conceivably one 
to item selection with Correct responses 
responses indicating more difficult ones. 


item selection could explain the eventual 
trect and error Curves found in eac. 

c dly t € Some selection effects i i 
magnitude is difficult to determine. Thus eh rl wh ans 


to correctly at a long la; ent, however. An item responded 
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short and long lags, with the effect at long lags (when LTS rather than the 
buffer may be assumed to be the predominating influence) exhibiting the result 
expected from interference theories. 

Figs. 6 and 7 both show that the predicted decrease becomes smaller as 
the number of reinforcements increases. The fact that the data seem to support 
this prediction helps confirm certain of the buffer-replacement assumptions. 
The decreasing effect as reinforcements increase is predicted because the 
probability of entering the buffer is reduced for an item receiving its third 
reinforcement; this occurs as a consequence of the assumption that an item 
retrieved from LTS on the test phase of the trial is not entered into the buffer 
on the study phase of that trial. ‘Thus as reinforcements increase, the probability 
of being in the buffer as a result of short lag a is partially counterbalanced. 

By and large, the data and predictions to this point may be considered to 
provide fairly strong support for the details of the model. The feature that 
has been left out of the model is that of LTS response competition, or negative 
transfer. ‘The model fails to take account of this effect because it ignores 
residual information in LTS from previous items using the same stimulus. 
This lack is most clearly indicated by the occurrence of intrusion errors; par- 
ticularly, errors that were correct responses on the preceding occurrence of that 
stimulus. For example, consider the following sequence: 


lag a lag b 
22-Z lmm—>| 22 22-X mm—?| 22 


(study) (test) (study) (test) 
Item receives Assignment 
its jth of new 
reinforcement response 


Item 22-Z is studied for the jth time and then tested at lag a. On this trial 
22 is paired with a new response X and next tested at lag b. By an intrusion 
error we mean the occurrence of response Z when 22 is tested at the far right of 
the diagram. ‘The model predicts that these intrusion errors will be at a chance 
level (1/25) and independent of lag and number of reinforcements. In fact, 
these predictions fail. Fig. 8 presents the probability of intrusion errors as a 
function of lag b, where the data are pooled for all values of lag a. Three curves are 
plotted for f=, 2 and 3. If itis assumed that the probability of giving the 
previously correct response as an intrusion error is some function of that previous 
item’s current LTS strength, then the pattern of results in Fig. 8 follows 
naturally. For example, the more reinforcements of the previous item, the 
greater its LTS strength, and the greater he probability of its response being 
given as an intrusion error. Thus this failure of the model is not very 
distressing, rather it was expected. The model could be extended in a number 
of obvious ways to take account of competing LTS traces without appreciably 
changing the accuracy of the predictions already considered. 
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A RETEST OF THE RESPONSE-BIAS EXPLANATION OF THE 
WORD-FREQUENCY EFFECT 


By JonuN MoRTON 
Medical Research Council Applied Psychology Research Unit, Cambridge 


This paper investigates whether the relation between the frequency of 
occurrence of a word and its sensory threshold can properly be ascribed to 
response bias. Seven models for the recognition process are tested against data 
of Brown & Rubenstein (1961). The models are of two kinds: probabilistic 
single-threshold models and information-processing models. It is concluded 
that the data can only be accounted for within the former class of models by 
assuming a differential effect of the stimulus as well as response bias. For the ! 
class of information-processing models, the data require the assumptions that there 
is equal sensitivity for words of all frequencies and a lower criterion for more 
common words. ‘The relation between the two acceptable models makes it 
apparent that the notions of ‘stimulus effect’ and ‘response effect’ are by no means 
as clear as has previously been thought. 


1. INTRODUCTION 


When individual words are presented acoustically in noise or exposed 
visually for a brief period, the probability of an observer identifying them 
correctly increases with the frequency of occurrence of the stimulus word in the 
language. Itis at present a matter of dispute as to whether this result is due to 
stimulus effects or to response bias (Broadbent, 1967). 

Ultimately it will be shown in the present paper that the rigid dichotomy 
between stimulus and response factors is, in part, a semantic artifact, for it is 
possible to devise information-processing models to which neither of these terms 
can be singly and unambiguously applied. The dichotomy, as it is usually 


employed, may be restated in the following way. 
1. It is possible that high-frequency words are more distinctive than words 


of lower frequency either by virtue of their ‘pure’ stimulus properties or 
because the recognition system is organized in such a way as to enable common 
stimuli to be processed more readily than less common stimuli. 

2.. The response-bias explanation of the word-frequency effect would deny 
the above possibilities, attributing the results entirely to the observer's tendency 
to use high-frequency words more often. Thus in the absence of any stimulus 
information, or if such information were minimal and insufficient for a clear 
‘ percept ’, the observer would respond according to his predispositions, that is 
‘guess’, and would respond preferentially with high-frequency words. 

Brown & Rubenstein (1961) claim that when response bias is controlled the 
word-frequency effect is eliminated. ‘To eliminate the possibility of response 
bias affecting the data they introduce the notion of frequency “intervals, 
কালাদমং্দাহলাদদিদক মম Bureau of Lo 
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5 . ¢ s of words 
interpreting the response-bias explanation as implying নে tengo I 
which differ in word-frequency are random samples from t thar BINA nt 
of acoustical or visual parameters’. This assumption, OG the Frequency 
Acoustical Equivalence, has the force that no information os) Es edly to be 
interval of the word can be gained from the stimulus and any Ww be 1 যা of the 
confused on the basis of its stimulus properties with a word from @ 
uency intervals. . algo 
os They divided up the 6500 monosyllabic content words into 12 Ee a 
500 words each by their frequency of occurrence. Six subjects Ww at two 
presented with 1300 words (100 from each interval i 
speech-to-noise ratios (S/N). 
frequency interval i: 


n random order) Ge 
; r 
‘Three numbers were determined fo 
ci the number of correct responses to stimuli in the ith interval; 
ss the number of responses wi 
the stimulus (this would incl 


ri the total number of respons 


f 
্ that 0 
hose frequency class corresponded to i 
ude both correct and incorrect responses); 


es in the ith interval to all stimuli. 


TABLE 1. Raw Data FRom BROWN & RUBENSTEIN 


(Maximum values of Ci and s; are 600. Maximum value of r; is 7800.) 


S/N =0 db S/N =10 db 
ৰি ও EEE EEE 
¢ 6 Si ri ci si ri 
1 21 34 77 325 332 367 
2 23 28 81 313 318 361 
3 55 68 135 390 394 448 
4 29 45 161 344 355 437 
5 29 39 159 357 371 450 
6 47 62 227 415 424 534 
3 55 76. 255 414 419 522 
ঠৰ 78 114 409 445 460 617 
10 08 145 608 428 447 618 
{ 12 155 674 427 442 657 
12 196 183 924 473 479 742 
13 2 213 1301 415 433 739 
p 2 388 2498 a8 527 883 
361 425 
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the Appendix, In addition, they b 
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error responses which are in the correct frequency interval. This equals 
si—ci. The values of ei at the higher S/N are too small, and are too irregular 
with respect to i to have value in a graph. The relation between e; and ci for 
the lower S/N is depicted in Fig. 1. ‘The data are plotted in terms of proportions, 
and the variables will be treated in this way in the body of the paper. 
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S/N = O db 


Probability of a correct response (ci) 


০° 


0-1 0-2 0-3 


Probability of on incorrect response in the correct 
frequency class (ei 


ce in the recognition of visually presented words of different 


5 [1 ¢ 
Fiend des ence. (Data from Brown & Rubenstein, 1961.) 


frequencies of occurr 


The line drawn is the best-fitting straight line: 
c1=0"033 + 1-526 er; 


and the best-fitting quadratic is: 

c= 0024+ 1-755 ei— 1356 eff. 
A further datum used below is: pv the total number or proportion of error 
responses in the ith interval to all stimuli. ‘This equals ri—ci in terms of 
numbers and ri—ctln in terms of proportions. A plot of ci against pi is given 


in Fig. 2 for both S/N values. 
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Brown & Rubenstein’s data will be used to test a number of সণ" 

/ nition. Some of these models have been explicitly it ET 

কালা ০ ক (see Broadbent, 1967, for references). ‘Theory E, however, 
las na previously been examined in any detail. 
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Probability of a correct response (ci) 


S/N: 0 db 
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Probability of an incorrect response over a!l stimuli (p,) 
FIGURE 2. Recognition performance with words of 


a | REE 
different frequencies of occur 
(Data from Brown 


NK Rubenstein, 1961.) 
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The first group of models examine 
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d are single-threshold models. Ace 
Ing to these, the subject guesses in the 


absence of a percept. 


Theory A. Either a word is r 


x ন a H > the 
ecognized or it is not. If it is not, the 
Observer guesses according to his dispositions which are independent 0 


a ্‌ তপ Ing 
stimulus. By the response-bias theory, the Probability of a word ত 
recognized, wx, increases with S/N but is independent of the interval ft. 

Stimulus is not recognized, then the observer EUCsses, the proportion of gu 


3 s z * & bs i - 1 Vacs 
In interval i being gi. Thus With mn intervals and AT words in each inter 
§lven a stimulus in the ith interval, it follow 


essts 


s that: 
1) 
$=%+ (1 —x)gs, ( 
1 (2) 
1 (3) 
ণ=(1- 1) 
Therefore, eliminating £1 from eqns. (2) and (3): 
1 (4) 
Bie Ss etx, 


M-1 
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“Also, given stimuli equiprobable in 1, 


ri=(x/n) +(1-x)gi (5) 
and 
1 
Pi= (1 = ন) (1-x)gi- (6) 
‘Therefore, 
a= [n(nM-— pits. (7) 


Eqn. (4) should correspond to Fig. 1; however, the slope of the data line is ie 
steep (since in the equation 1 = 500 and n= 13). Likewise, the slopes predicted 
for the relations depicted in Fig. 2 from eqn. (are too. SEND; 'SVUOD: SRSUTNE 
linearity. ‘I'he simple theory is therefore incorrect. 


Theory B. If the stimulus contains information relating to its own class 
then the proportion of responses in the correct class increases. ‘This affects the 
second term on the right-hand side of eqns. (1), (2) and (3), but leaves eqn. (4) 
and the slope of eqn. (7) unchanged. The modification of Theory A must 
therefore also be abandoned. 


Theory C. It is possible, and intuitively obvious, that on a number of 
occasions, while insufficient stimulus information is available to enable the 
Observer to recognize the stimulus, there will, however, be enough to enable him 
to make a choice from a restricted set of alternatives. © Thus if the initial phoneme 
It/ is heard clearly, only words beginning with ‘tt’ will be considered by the 
observer. By the theory of acoustical equivalence, however, this restricted set 
of alternatives will be equally distributed among the separate response intervals. 
ie forms of eqns. (1) and (5) will therefore remain identical. Eqn. (2) will 

ecome: 


1 
t= 24+ — — X)St 2a 
cG=4+ 0 XN)gt, (2a) 


where mis the number of words in the reduced set in each interval. ‘Thus eqn. 
(4) becomes: 


1 
a= ut (4a) 


m- 
and eqn. (7) becomes: 
ci= [n|(nm—1))pit+t x. (7a) 

In eqns. (4 a) and (7 a), m has a lower limit of unity, since it refers to the 
number of possible words in the correct class in the original definition of ci; the 
occurrence of min the last two equations is determined entirely by this definition. 
Clearly, in the limit the strict definition of the principle of acoustical equivalence 
breaks down, for it 1s possible that onty two words are possible responses, one of 
which will be the correct one. ‘The modified principle would state that the 
other word is on average equally likely to come from any other interval. This 
limiting modification does not, however, affect the equations. 
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Given that m>1, the slope of the function defined by eqn. (7a) has a 
maximum of 13/12=1-083. If the (unlikely) assumption is made that the data 
in Fig. 2 are indeed linear, the equations of the best-fitting straight lines are: 


€1=5:09 p;+0-56 (for S/N =10 db), 
01=1:29 pi +005 (for S/N =0 db). 


Both these lines have slopes in excess of the permitted maximum and model C 
is thus untenable on the basis of Brown & Rubenstein’s data. 


Theory D. A combination of models B and C still yields eqn. (4 a) and a 
form of eqn. (7 a) with the same slope. This theory cannot be accepted either. 
It would appear, then, that threshold models which deny the stimulus any role 
in the word-frequency effect cannot account for the data. 


counted for quite satisfactorily. It will be 
accepted, then, that the Proportion of correct responses resulting from the 
stimulus alone, x, is a function 0 


otha he A Kk 1 as well as of S/N, initially 
assuming that the stimulu nas to its interval and that the 


obtained: The following equations are then 
0=4%i+ 1 
(=H+ (1 -xi)gt, (8) 
&a= [1- i MU 9 
M X1)81, (9) 
mn=(1-_ 
% টী 5) (1-X)g,, (10) 


where X= চ i 
হে (un). From these equations the following can be derived: 


A= TE eta, mn) 
; 
ci=p, Al - x) 
“M(-¥)-( x) i 
Since x; is now a function of 
fssumption that M is small, and thet Jey represent Fig. 1 without the 
longer apply. Equally it is boro 
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If x; is eliminated from the modified eqns. (2) and (3), then 
M-g 


a=l-u HT (13) 
Thus 
eM 14 
#= ECM’ at) 
If Mis large, we may simplify and rearrange this equation to obtain: 
gi=eilll—c). (14a) 


Values of gi were calculated from this equation and are tabulated in Table 2 (a). 
If the model were accurate, the sum of the values of gi plus go, the probability of 
producing a response outside the classification system (i.e. blanks, partial 
responses and polysyllabic words) should be unity. This value is, in fact, 
exceeded at both S/N. This is not a consequence of the approximation, since 
the effect of reducing the value of M in eqn. (11) is to increase the calculated 
values of g for given values of ci and ei. The values of g being too high could 
result from the values of either ci or ei being too high compared with the values 
of the other. One possible explanation of this is that words do carry some 
information concerning their frequency of occurrence which would have the 
effect of increasing ei relative to ci. A second anomaly in Table 2 (a) is that 
values of g for high frequency words (7=10 to 13) are proportionally lower at the 
high S/N. These two points will be discussed below. 


TABLE 2. VALUES FOR gi CALCULATED FROM (a) EQN. (14a), (b) EQN. (15) 


(a) #i=eilll-a) @) ai=pill-a) 

CC EE 

i S/N=0 S/N=10 S/IN=0 S/N=10 
1 * 0:023 0-026 0-008 0-016 
2 0-009 0-017 0-009 0-019 
3 0-024 0.019 0-012 0-023 
4 0-028 0-043 0-020 0-036 
5 0-018 0-058 0.019 0:036 
6 0-027 0-049 0-027 0:046 
Vl 0:038 0-027 0-030 0-042 
8 0-069 0-097 0-049 0-067 
9 0-081 o1ll 0-074 0-074 
10 0-108 0-087 0-084 0-089 
11 0-122 0:047 0-118 0-104 
12 0°178 0-097 0-173 0-126 
13 0-408 0-354 0:323 0-154 
1133 1-032 0:946 0:834 
oes 0-054 0-166 0-054 0-166 
1-187 1-198 1:000 1:000 


Sum 
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ন = ()gi + (8) and (10). The 
i then ci= xi, and pi=(1-— X)gi from eqns ( t 
Re. A edn be astimated from the average number correct and we may 
write: 


81=2l(1 —C), qs) 


where C=Yciln. These estimates of g are given in Table 2 ds br পৰা 0 
eqn. (15) ensures that these values sum to unity. As with I able 2 (a), the values 
of gi for higher values of i are consistently lower at the higher S/N. 


3. INFORMATION-PROCESSING MoDELS 


The above models have as their basis the probability of certain 
taking place. If instead we regard word recognition in terms of কপাৰ 
Processing we obtain different equations, use different parameters, and disco 
different implications for the notions of stimulus effect and response effect. 

The basic unit to be considered is one which accumulates evidence from 
the stimulus. When more than a certain amount of evidence is available, the 
Word is recognized. On the assumption that all the units are inherently noisy, 
We can treat them in terms of signal-detection theory (Green & Swets, 1966). 
The two relevant parameters are then the i 


available, 4’, and the thresholds or criteria of 


Theory F. A model which Ww 
in the introduction, would Say tha 
of occurrence of the word 
Perception (Halle & Stev 
1963) would have to pred 
theories would have the c 


as purely stimulus 
t the value of d’ wa 
as well as of S/N. 
ens, 1962; Liberman, & 
ict such a result. 
riteria dependent o 


Theory G. An alternative model is One in which 


Words but the value of B changes with frequency, 
evidence is required for a high-fre 


based, in the sense described 
Sa function of the frequency 
‘“ Active’ theories of speech 
O0per, Harris & MacNeilage, 
It is Uncertain as to whether these 
n frequency. 


4d’ remains constant for all 
In such a model, less stimulus 


> ডু 
‘This matrix is given in 
ord in the absence of . 
1S a multiplicative factor representing th 
related to @’. he value of w represents a contribu’ 
esponse compared with the aot 
of intervals, The Probability . #50 assumes the acoustic equivalen 
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AT Tesponse is then given by t 


পথ 
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response strengths for that response divided by the sum of all response strengths 
for that stimulus. Given a stimulus in interval i, the proportion of correct 
responses, cr, is given by: 


a 4 
I= (S—-DVi+(MZ+1)’ (16) 


where Z=3Vy (=D... 5 1), which we may more simply write as 
a= oVilT!:.- (16 a) 
‘The proportion of incorrect responses in the correct frequency interval, ei, is 
given by: 
a=[M-DVIIT:- (17) 
From these two equations the following relation is obtained: 
a= eila/(M-—1)). (18) 


TABLE 3. RESPONSE STRENGTHS 


Stimulus Other 
interval Correct Incorrect responses by interval responses Total 
4 a i fue n 
1 aV,  (M-IV; MV; Mn 1 Ti 
i aiVi MV, .. (M-UVi MV 1 7 
n QnVn MV, ঠ MV; . (M- DV 1 Th 


Fig. 1 shows this prediction to be largely upheld. The possible negative 
curvature and the non-zero intercept could be interpreted simply as due to the 
subjects recognizing words which they had never before seen and so could not 
produce as incorrect responses; they would effectively be nonsense syllables. 
Any recognition model would have to treat such words as exceptional cases. It 
might be noted that such an explanation would serve to account for the non-unit 


in Table 2 (a). | | 
ck ন F oe i Fig. 1 would be positively accelerating since o 
would increase as i (and so ci and ei) increased. This prediction 1s emphatically 


rejected by the data. 


4.A COMPARISON OF MopDELS E AND G 
088 i may seem to be in conflict with 


The conclusion that a is constant act : in 
here it was seen that xi had to vary with i. However, 


the result from model Ew j I I 
ted in terms of the amount of information available 


while « can only be interpre j j ! 
from the stimulus, x1 merely specifies the proportion of correct responses which 


30 John Morton 


result from the stimulus. From eqns. (8), (9), (16) and (17) the relations among 
the parameters in the two models are given by: 


(0-1), a 
“= GC) (MZ71)’ 9) 
#= MVAf(MZ +1). (20) 


Thus «x, represents a combination of both stimulus and response strength 
factors and gi is proportional to V;, being specifically the probability of making 
a response in the ith interval in the absence of a stimulus, i.e. when a=. 

If eqn. (20) is summed over i we obtain: 


X= MZ/(MZ +1). (21) 


However, it has already been observed that the sum of the obtained values of g is 


greater than unity. Thus theory G also requires some modification to account 
completely for the data if the explanation follow 


1 
a=+ (1 -m)lyr+( -yded, La 


Where mis the size of the reduced s 

stimulus in the ith interval is not re i 

fr . . 1 
om the stimulus. I € to assume that both m and yi are functions 


alues for g; from the relation: 


#@= mei — yi[(1-— ci)(m — 1)+e] 


23) 
I-yT—-a)m- 1)+ef' ? 


HO EE ¢ 
n this equation, it turns out that Le 
OF a Constant y, which also give positiY 
» if yis increased to reduce the values of gv 


that is, an Unrecognized high-frequency YL 
eS a low-f 
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RE by 5 but does include the size of the reduced set of alternatives m. 
is is 


c= eil af(m -1)]. (24) 
The equivalent of eqn. (20) is the following: 
mV mei (25) 


mZti™ m—DI—e)l+me’ 


This equation must produce positive values for Vi. As 5is increased, the value 
of the function falls, and as mm decreases the value rises: this is analogous to the 
properties of eqn. (23). For any mm, § can be adjusted to fulfil the restrictions on 
the sum of the left-hand side of the above equation (cf. eqn. (21)) without having 
to postulate that 6 varies with 1. 

By either of theories E and G, Brown & Rubenstein’s data can only be 


accounted for by assuming that the stimulus contains information about its 
interval. 


With theory E, it is in addition necessary to propose that such informa- 
tion is greater with high- than with 


low-frequency words. Such a proposition 
could work in practice through partial information and as such would be associa- 
ted with the size of the re 


duced set which was considered above. For example, 
if all that was recognized clearly was the letter 


‘7°, then the set of possible 
responses would be relatively biased in favour of low-frequency words. On the 
other hand, to give a hypothetical example, 


since the necessary statistics are not 
available: if the letter sequence ‘-the-’ were recognized clearly, the set of 
alternatives could be biased in favour of high-frequency words. 

The large change in the relative sizes of the entries in Table 2 at the higher 
S/N, which affects gi in theory E and Viin theory G, requiresa further construct. 
‘The observation amounts to saying that as more stimulus information becomes 
available, proportionately fewer high-frequency words are produced erroneously. 
This could be accounted for in both the theories by postulating that whereas 
the amount of interval information given by a stimulus is greater for high- than 
low-frequency words, the difference becomes less at higher S/N ratios. This 


would apply to either yi or 6 in the two theories. 
He 2 finding would be to assume that the 


An alternative way of explaining this fin ! | $ t 
recognition-system, while requiring less positive stimulus information fora high- 
frequency response to be made, also requires less negative stimulus information 


for such a response to be rejected. For lower-frequency words the system 
would be relatively indifferent to negative information, being only concerned to 
accept positive evidence. Such a proposition would require a far more detailed 


specification of the recognition system to be tested. 


5. CONCLUSIONS 


It has been shown that while a simple guessing model cannot account for 
the word-frequency effect, a more complex one can fit the data. The dis- 
advantage of this theory compared with the one derived from signal-detection 


5 John Morton 
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I ion of more precise data. | 
will have to wait upon the সন en G is such that it is impossible to 
ee A লাল কাঁ either to ‘stimulus effects ’ or “ response effects °. 
হ্‌ শে 2 Cu Y 
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neo ; i : the same internal event for a 
i imulus information to produce the ss L ol 
7 কমন 7 Word. Jt is not necessary, 
i than for a low-frequency word. f | 
2 a লক মী System described in theory G 
i f a two-stage Process; the system des : 
“আর ককাকে r effect is concerned. 
i i far as the word-frequency effect is cernec 
is a single-stage process, as 3 Nps aia 
Equally, the discussion as to whether the Word-frequency effect has a LR 
the frequency of experience or the frequency of emission of the v f 
(Rubenstein & Aborn, 1960) is unnecessary. Within the preferred ক 
a distinction becomes irrelevant (Morton, 1964; Morton & Broadbent, ছ 
These conclusions agree with Broadbent (1967), who examined theories A, C, 
and G, concludin ount for his data. 
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Where e/,; is the Probability 
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In fact, the above equation does not follow from the Principle of acoustica 
equivalence. Consider th 
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TABLE 4 
Response interval 
F_ 
1 2 $ 4 Total 
1 w a b € N 
Stimulus 2 d x e fd N 
interval 3 8 Ah y i N 
4 FE, k 1 z N 
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COST, DISCRIMINABILITY AND RESPONSE BIAS 


By PHILIP 'TWITCHELL SMITH 
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non-negative parameter, is defined and 
developed. Four models for the cost incurred in making a psychophysical 
decision are presented; two of these models are based on signal detectability 
theory and two are based on choice theory. In all models, cost is a function of the 
discriminability of the stimuli presented and the observer’s response bias, and it is 
shown that in three of the models the form of this function depends critically on 
discriminability. ‘The models are applied to the analysis of a reaction-time study, 
a vigilance study, and the statistical properties of the consonant systems of four 


natural languages. 


Cost, a unidimensional additive r 


1. INTRODUCTION 


The purpose of this paper is to develop the concept of cost, and to apply 
this concept to the analysis of skilled human performance. The motivation for 
this approach arises from the belief that psychologists have not been sufficiently 
thorough in their application of terms like cost, effort, capacity and efficiency to 
the analysis of complex human performance. This point will be developed, 
with particular reference to psycholinguistics, in later papers. 

The word ‘ cost ’ is used to indicate a universal currency in terms of which 
allof a subject’s actions (or at least an important subset of them) can be evaluated: 
it rests on the intuitively attractive assumption that there is a unidimensional 
parameter that can be attached to every action and which is fundamental in 
determining the bounds of further courses of action. This concept is closely 
related to what some psychologists (James, 1890; Thurstone, 1930; Zipf, 1949) 
have called effort; it is a generalization of the notion of pay-off in signal detect- 
ability theory and utility theory, and it is used in the same spirit as some 
psychologists (e.g. Garner, 1962) have used an information measure. hj 

The basic notion behind the cost concept is that when a human subject 
ision or carries out an action, he does so in terms of a sequence of 
fundamental processes. A cost (real-valued parameter) can be attached to each 
fundamental process independent of all other processes, and the total cost of a 
sequence of fundamental processes is the sum of the costs for each individual 
fundamental process: at the level of fundamental processes, cost is additive. A 
second requirement for a cost function 1s that costs derived from quite different 
systems of fundamental processes shall be commensurable, i.e. cost 1s uni- 


dimensional. | | | 
As an example, consider a desk calculating machine. Although the machine 
ed to carry out addition, subtraction, multiplication, 


can be pre-programm * & hl es 
division and other more complicated operations, usually such a machine 1s 


makes a dec 
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iti ati in rapid 
basically only a device that can carry out repeated AdCition ৰজ +০ হলা এ 
Succession, together with a number of procedures for conv menial 
division, etc. into a series of addition procedures. Thus 1 TR 
Processes of this machine would be the addition Procedure anc EY ns Of 
between successive addition procedures. (There might be severa rs 
transition that should be dealt with separately.) ‘The cost of a comp TE 
culation could then be defined as the Sum of the costs for each addition Le fhe 
Plus the costs for each transition. Identifying the fundamental লা দিস 
system still leaves open the precise measurement of cost: in the EESent কল 
the time it takes to complete the calculation or the wear and tear to the " i 
of the machine would be ‘ natural ° Ways of defining cost if exclusive intere: a 
centred on speed or durability respectively, and both these measures W 


2 oe Ne : fundamental 
satisfy the requirement of additivity under the identification of the fundai 
Processes. However, conversely, 


radically alter the identification of the 


চর টু | amental 
Operations in parallel, the চলল a 
b) a 
measure of cost in terms of wear Le ne (1 
ee H Le ne 
itive measure of cost in terms of tir 


could be proportional ton). 


) 
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2.1. Models based on Signal Detectability Theory 

| it is assumed that each stimulus presented to a 
subject can be assigned a value by the subject along a unidimensional continuum, 
which will be called the decision axis. The subject knows a priori (probably by 
previous experience) the probability distributions of these values for any given 
stimulus. (In general, different presentations of the same stimulus produce 
different values, because of imperfections in the sensory system.) Both theory 
and experiment suggest that these probability distributions are normal dis- 
tributions. When the subject has to judge which of two possible stimuli has 
been presented to him, he is assumed to use a criterion level on the decision axis, 
such that if the value for the stimulus presented exceeds this level, one stimulus 
is reported, and if the value is less than this level, the other stimulus is reported. 


In signal detectability theory 


Probability 


6 Xx 
(decision axis) 


FIGURE 1. Probability distribution of two stimuli along a unidimensional decision axis. 


s and s’ represent the probability dis- 
ft c (x axis) for the stimuli s and $’. ‘The 
subject will respond to the stimulus s’ if the value of the stimulus presented 
exceeds C' on the decision axis, and to stimulus s if the value of the stimulus 
presented is less than C. The particular position of Cis determined by pay-off 
conditions (the rewards and punishments for the various correct and incorrect 


responses). 
For the models of cost in this context, it will be assumed that collecting 


on about the probability distribution does not contribute to the cost 
he computer example this is preliminary program- 
e task begins.) In this section 
le sample of the value 


"Thus in Fig. 1, the curves labelled 


informati 
incurred by a subject. (Int 
ming, which has already been carried out before th I 
the system arising when the decision 1s made from a sing 


of the stimulus presented will be considered. Ee 

There are two possibilities: that making an individual decision costs the 
same no matter what the situation iS, Or else that some decisions are inherently 
more difficult and costly than others. ‘This second possibility could arise as 
follows: suppose that determining the value of the x co-ordinate for a particular 
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stimulus presentation involv 


ring outa. 
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Model 2 This is simpler, but does not make explicit the underlying 
It is assumed that there is a certain critical region 


fundamental processes. 
Outside the critical area 


around the criterion level where extra cost is involved. 
the cost is minimal, i.e. 

E=E, (p>), 

E=E, (p<k; E> Eo). 


rves of cost against stimulus value are shown for the two models in 


‘The cu 
Fig. 2. 


Cost Cost 


E) F------" 


| EE 
> 


C (Criterion level) x শা 


Model 1 
FIGURE 2. Cost functions for models 1 a 


Model 2 
nd 2. 


exact solutions are required, but since 
Its, and since approximate solutions, 
attention will be confined to 


Neither model is easy to handle if 
both models give approximately the same resu 
as will be seen, are most easily handled by model 2, 


this. 
e derived. For model 


Some results about average cost in a given task can b 
2, the average cost is a linear function of the proportion of decisions the subject 
makes that are in the critical region around the criterion level. If this critical 
region is small compared to the standard deviations of the stimulus value 
probability distributions, then to a good approximation the proportion of critical 
decisions = width of critical region (2k) xthe sum of the probability density 
functions at C(Z), i.e. E= El —2kKY)+2RES. If, for a given sensory system, 
it is assumed that k is constant, this gives the result that average cost is a linear 
function of the sum of the probability density functions at the criterion level. 

This leads to some interesting results. Consider two signals, s, s', each of 
whose stimulus value distributions has variance c2. If the means of these 
distributions are separated by a distance d and either signal is equally likely to 
occur, this gives the situation pictured in Fig. lL, with d=dlo. If d’ is 
sufficiently large, the average cost as the criterion level is changed will be less 
for central values of criterion level in between the peaks of the two SUD On 
since the sum of the probability density functions 18 smaller I EE one 
the peaks, and thus less critical decisions on average will be Le Nj a 5 
however, cost will be larger for central v his is illustrated DY 


Fig. 3. 


alues of criterion. 
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in Fig. 3 ralues of d' 
The dip in the cost curve shown in F 1g. 3 (a) appears for kee RS 
greater than 2 (for proof, see Appendix 1). This result has ee rl an 
that for easy discriminations (large 4d’) it involves less cost to be VER af 
reverse is true for difficult discriminations (small 4’). Table 1 lists § 


i i i carious criterion 
Probability density functions for two equiprobable signals, for various 
levels and values of @’. 


- 


Sum of Probability density functions(E) 


density 
functions 


tuncty/ons 


Criterion level 


Criterion level 

(a) 6) al 
Fe | | 

FiGuURE 3. Sum of probability density functions for two stimuli: (a) separated by large 
(b) separated by small 4’. | 
0 
er LI 

TaniE 1. Sums or PRoBaABILIry Densrry Funcrions or Two EQuiPROBABLE STIMU 
| 
(Bis the value of the Probability ratio at the criterion point C.) 
B 
3 4 

0 0-80* 


bs 0.77 0.55 0:30 0.07 0-02 
js 220 066 059 044 ০0:33 
i 4 0.59 0.57 0:53 0:48 
i ee 048 048 048 0-48 
: 0:37 0.37 0-38 0-39 

0-26 0-26 0:27 0:28 0:30 
# Strictly speaking, this is the limit as d’ 
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function A of S,R,s,r,t for each stimulus s and 


is derived by postulating a 
of making the response r given the stimulus 


response r, such that the probability 
sis 
Prfs)<cA(S, R,s, 7, 0). 
This model is made more specific by supposing 
Dlrls)ecA(s, LU) 0); (1) 


sponse r given the stimulus s is proportional 
and the identification function t only: the 
ve responses to r in R are considered 


i.e. the probability of making the re 
to a number determined by s,7, 
alternative stimuli to s in S and the alternati 
irrelevant to determining the size of A(s, r, dl). 

The proportionality sign in eqn. (1) can be removed by rewriting this 
equation as 
¥ A(s, 7, .) 2 

= 7° 
Mrb)= = AG) 0) 


r'eR 
Eqn. (2) can be made yet more specific by assuming 


Afls,r, )= 1(s, U(r))b(?), 


where 7 is a mapping of SxS into the non-negative numbers, and bisa mapping 


of R into the non-negative numbers. Tisa similarity function: the more similar 
s and s’ are, the larger 1(s, $’) is; and bis a response bias function: the greater the 
bias towards a particular response 7, the greater is b(r). Further assumptions 
are usually made: the most common is that —log 7 is a metric on S XS. 

This is not the place for a critical review of this model: for a more careful 
outline of the assumptions in this model, the reasons for making them anda 
review of experimental work relevant to the model, see Luce (1963). 

For our purposes it need only be noted that (1) the model is probabilistic: 
there is no way of determining on a particular trial what the actual response will 
be with certainty; (2) it is a summary of performance, not a description of it: 
in the present terminology, the fundamental processes are not explicit; (3) the 
complicated notions of stimulus discriminability and response bias are each 
handled by single numerical quantities, and these quantities are combined 
together according to simple arithmetical rules. ‘The models of cost to be 
considered will be concerned (1) with the cost in making choices given a set of 
probabilities, (2) with the cost associated with computing and maintaining bias 


parameters. 


2.3. Response Entropy Model 
st in a choice situation where the response 


s that cost is proportional to entropy. 
er com- 


Model 3 This deals with co 


probabilities are given, and simply postulate h: t alg 
The system is a5 follows. Given an unidentified stimulus s, the receiv! 


putes a set of probabilities {p(rls)} specifying the probability of each response r 
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(the present choice model Predicts what these Probabilities will be le 
specifying the mechanism by which they were Obtained). Itis assumed that t 


he restriction that the set of possible responses has 
For example, suppose p(rils)= 

Dlrsls) =}, and lrils) =0 for i> 3, i.e. if the receiver were required to make a long 
i Ponses in this Situation, 50 per cent of his aon. 

2: his case the receiver would UE 

ecide between 71 and r,. ‘The notion of coin-tossing 
r Neral case by Supposing the receiver has some EE 
of Benerating random digits, When a Particular sequence of digits is generated, 


h . « ider four 
this produces a Signal to emit a Particular response. For example, consider fot 
Possible responses Ti, 7 


213,74, to a stimulus 5, with p(r,ls)=!, Mra) = and 
blrasls) =p(r,s) = » and £ binary random digits. ie 
€ sequence 0, r, after 10, rs a 
binary digits would occur un 
A method for constructing Suc 
uffman method of SONU 
; Abramson, 1963). This leads 
Y Sequence required by this decisio 


722)p() logs p(rs), 


iu at 
Or any probability 4; familiar theorem of Shannon (1948) allows us to say th g 
Y distribution and sequence of a-ary random digits, the averag 

ecision is 


722p()p( slog brs), 


anc 


=y, 
Thus =() Otherwise 
SL Blrhs) = 5s, (J) and Xp l)log p(rs) =o. 
This implies that cost (and he st is the 
same for all ce Teactio time); সা hatte 
responses and stimuli and i )isata Minimum value tha 25 


SE" 
A nde ende sr of respon 
ty of the stimu ent of the number of 


—_—~ == 


Cost, Discriminability and Response Bias 43 


For zero discriminability 1(s, $)=R#0 independent of s and s', thus 


B(rls) =p(r) and 
-সSp()p(r}) log Pls) = 220) log P(r). 


In this case, cost and reaction time are linear functions of response entropy. 

Superficially at least, this accords well with experimental results (e.g. Garner, 

1962, ch. 2). 

Twill be useful to compare the properties of this model 

ing signal detectability model in the previous section. 
To make a direct comparison with the signal det 

two equiprobable stimuli s, $’ to which the appropria 


Taking 1(s, s)= 1($', $°), and writing 


with the correspond- 


ectability model, consider 
te responses are 7 and 7’. 


ss) 0) 
ED) HN) 
that 


it is not difficult to derive (see Appendix 2) 
mean cost =E=} {tog +1) +log(n +0) 


1+2170+02 1+2n0+1* } 
Et | oe 5 | iE tee SBC b 
Ku ন ogn—b | GED C+D) 


and E=0 when b=0 or ow (the cases of maximum 
responses). Moreover, when b=1 (the case of no differential bias between 
responses) Eis a maximum if 7) < 4681 and a minimum if 7 > 4-681 (Appendix 2). 
This leads to dromedary-shaped and camel-shaped curves similar to those 
obtained for models 1 and 2 (see Fig. 4). 


differential bias between 


Cost Cost 


Increasing No Increasing_, tog b Increasing | Increasing _, logb 


“bias bias bias bias No bias 
EET bias 

High discriminability 1>4-681 Low discriminability 1<4-681 

FIGURE 4. Costas a function of response bias for model 3. 


Note that the critical points (for transition between the bimodal and 
unimodal distributions) for the signal detectability models (4’=2) and for the 
choice theory model (1 =4681) are quite closely equivalent. For example, the 
minimum error rate at the critical point on the signal detectability models is 
15:8 per cent and on the choice theory model it is 17-6 per cent, 
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Not only are the shapes of the cost 
models, but the variation of cost with 
models. ‘Table 2 compares cost predi 
Probable stimuli when there is no differe 
error rates: the range of cost predicted 
model, but the two models 


bias curves much the same for the ie 
discriminability is comparable for od 
cted by the two models for two ঢল 
ntial bias between responses, for Ve 
is slightly smaller for the hioioe ie : 
give very similar results for low error poe? RL 
) that the ROC curves predicted by signa de hd 
OrY are also very similar, and one way of HOE 
ery different concepts, the two models are in 


1d’.) 


Cost 
Signal 

Error detectability Choice 

rate theory theory 

(model 2) (model 3) 

0-500 1:67 1:48 
0-401 1.60 1:44 
0.309 1-46 1-32 
0-227 1:25 1-14 
0-159 1-00 0.94 
0-106 0°75 0-72 
0.067 0-54 0-53 


ter’ 
consider a ‘ Yes—No° Choice (Signal vs noi 
Subject receives a sma WE song 
fined for failing to det 
expectation of reward from a long ser 
criterion level is adopte 

“ No’ only when h 
Suggestion is that in this Cc 


(4) 
a ) xampP 
evel involves cost. For ex e 


nd 52)’ 
Ys ‘Yes’ almost all the time, a The 


at no signal was presented. that 


as i ন চ ‘mull 
€ the Subje Urs cost in accepting stim 


Ct inc 


} 
¢ 
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he would ‘normally’ reject as signals. Some experimental evidence can be taken 
as support for this view: Norman (1966, personal communication) reports that 
although subjects in such situations are aware of the necessity for maintaining 
extreme criterion levels, there is a tendency over a series of trials for them to 
recede to a more moderate criterion. 

The fact that choice theory is a summary and not a description of perform- 
ance again means that the fundamental processes are not obvious and only the 
form of the cost function will be proposed. It is postulated that B(r) is the 
‘natural ’ level of b(r) and that for b(r) > Br), cost is a monotonic increasing 
function of b(r)-B(?), and for b(r)<B(r), cost increases monotonically with 
B(r)—b(r). It is difficult to be more specific than this without making dubious 


assumptions. 

Model 4 For explicitness this model makes a few such assumptions. 

(1) In the two-choice case, if b(r) = B(r) and b(7')= BUY’), the cost involved 
in increasing b(r) to kb(r) is the same as that involved in decreasing b(r’) to 
k-tb(r'). (Note that both these changes have the same effect on the set of 
probabilities {p(r|s) }-) This suggests that log b(r) rather than b(r) is a more 
convenient unit with which to measure cost. 

(2) Cost associated with maintaining b(r) at a leve 


natural level B(?) is 
E= {log [b()/BO)]}. 
t function that satis 


1 different from its 


This is a simple mathematically convenien fies all the con- 


siderations outlined so far. 
In the two-choice case, cost E can be taken as 
E= [log (b(r)/B(r))]* + [og (6) /BO)] 

The usual minimization procedures show that for B(r) and BUY") fixed and 

b(r)/b(7')=2 kept constant, Eis a minimum when 

log b(7)=3 log b +1[log B(r)+log B(r')], 
| © log b(r')= —1 log b+ log BO) +log BO), 
i.e. the logarithms of the bias parameters are +} log b and —3 log b away from 
the mean of the logarithms of the ‘natural ’ levels of the bias parameters. In 
this case | 
min E=} {[log b—log B(r) +log B(r')]*}. 

Fig. 5 shows the variation of min E with log pb. Cost is zero when b(r) 
= Bl), br") = BCU") and log b=log B(r)—log B(r'). This model can be applied 
to the problem of distinguishing a pair of stimuli of unequal frequency. In this 
case, if p()=P and pls)=1-P, 

error rate= Ph(r'ks) + (1 - Pps’) 


1 b 
PHP) 
Ro Ft Le 


(from the results of Appendix 2). 
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It can be shown (e.g. Luce, 1963) that optimum b is then given by 


= I VIO=PE] 
VOB 
t giving 0<b < wo; otherwise b=0 or oo. i 
Now min E~i(log b)* for b large or small compared with unit, and ot 
fixed and sufficiently large or small P, optimum b, as given by eqn. (3), can 
made arbitrarily large or small. ‘This means that if in practice there is a নী Le 
the total cost that can be incurred, Optimum values of b are simply unattaina | ? 
for extreme values of P. In other words, for fixed 1, efficiency, as measured La 
how far the error rate departs from the minimum obtainable error rate, Monn 
less in the two-choice cas abilities of the two stimuli he 
i ata supporting this statement are provided by Gree 
(1960). 


(3) 


Provided this leads to a resul 


different. Experimental d 


min 


log pb 
FiGuRe 5. Cost 


sa function of Tesponse bias for model 4. 


as to handle sequenti | Model 4 is that 


5 & I) 
It is formulated in such a way 
‘natural ? level of 


he 
It could be assumed that de 
ade, 
i ) ac r the response 7 has been m: 
Tr 
handle eo dt original level Sradually over time: this sort of model won 
effects টু fs ! RD 
i i Xpectancy ° encountered in rea 
time studies (ee: Bertelson, 1961). Pp Yer 


Besides the requir ye t 
cement 2 eu fis tha 

costs derived from differe additivity, other critical feature of cost SE 
far, the description of he ne UTCes shall be commensurable. In the discussion 

ha h মী 0 te processes of decision has been simplified in such a way 
that " bE) 2 decision can be e # not too complicated function © 

যশ ত hSuch a preliminary investigation 

Cation to make OT studying commensurability 

that all the Consequences of a deci: Eaflctad), 80 | off matri* 

ected in a single pay- The 

Cost of making the decision. 
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cost of making a decision of a certain accuracy can then be balanced against the 
reward to be received for such accuracy—in certain cases it is conceivable that the 
reward for accuracy is so small that it is not worth entering a costly decision 
Process to improve accuracy or, at the other extreme, it is possible that accuracy 
is so important that it is never worth terminating a decision procedure while 


greater accuracy can still be attained. 


3.1. Cost of Decision vs. Improved Accuracy of Response for Models 1 and 2 
Consider a zone of uncertainty of length h, in which the criterion level lies, 
where the observer's calculating limitations do not allow him to determine, 
when the likelihood ratio falls in this region, whether it is above or below 
criterion. Outside this region, accurate determination is always possible. 
| If cis criterion, and fy(x) and fy(x) are the probability density functions for 
signal and noise respectively, then if there were no calculating limitations, 


MAYB= | fra. « 
¢ 

g ‘ Yes’ to signal when such calculating 

lity of responding ‘Yes’ to signal 

is distance g from the lower bound 


ed that, over repeated trials, the 
n the zone of un- 


If p(Y'|s) is the probability of respondin 
difficulties exist, and p(Y's, £8) is the probabi 
in the particular case when the criterion level 
of the zone of uncertainty, then if it is assum 
criterion level is randomly and uniformly distributed withi 
certainty, 
1 ph 
AYB= 5 J 20 ds 6) 
and | 
AYbe- fT fe0) dra ) “He (a) dx, (6) 
ct+h—-g e—g 


here «i i 
7 fe sue of “ Yes ’ responses made in the zone of uncertainty. 
EL ৰ | hat the observer has no information about the position of the 
one 0 uncertainty relative to the criterion level, then «a will be independent of 8 
Writing | 
ye 

Fi PE 
F)= fC fet) ds 
— %0 


and observing that 
Fy(c+2)=Fy(c) +2fy(0) +32 (0) 


by Taylor’s theorem, then eqn. (6) can be rewritten as: 


P(Y'hs, 8) = 1 —Fy(ct+h-g)+ alFy(e+h-8) + Fy(e—8)] 
=1-Fy(d)-(- lh —g)fy(0) + 30-8) (0) 


+ olgfy(e)— 38 (0) 
=p(Yh)- Ila) +g]fy0 3 


to second order in 2; 


1-0)-2hg(l- a) +g]fy (0). 


Applying eqn. (5) gives: 
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MY’) =p Ys) + Ih(2a — fy (c) —h2f;.'(c)/6. 


Similarly, EE) 
fi MY) =p Yn) + 3h2a 1 s(e) — 2,016. 
Suppose the pay-off matrix is as follows: 


¥ N 
$ 01 02 
n 021 022 


Pp fain in f l 
* id are used instead of 
Then the ex ected gain i reward (G) when MY ls, MAY In) are used i 
M(Y [s), A Yn) is: 


| 

| 

G=th2u- 1f,(c)P(o,, 012) hf, '(c)P(o,, ~012)/6 | 
ie 1 Plone Af 

hf y'(c)(l —PYlos, = 022) /0. 
| 


Now if criterion is optimum, i.e. if it maximizes expected reward, 


1963, [ 
(This is a standard result of si 
PP. 148-149.) ‘Therefo 


gnal detectability esult 


fe e 
theory, e.g. see Luce, 
re, terms in h cancel out i 


n eqn. (7), giving the f | 
G= THR) sf,(0)), ll | 


Where R, 5 are Positive constants deter 


nd 
the pay-off matrix, Note that in the s 


10 a 
ys NEN" OR ItI1CS 
mined by the a priori probabil f) 


> R= 
Ymmetrical unbiased case, where 
f(s) =(27)-12 exp (— 


iw — 1d’), 
Axa) =(2n)n exp (_ 4 1d) and c=0, | 
Rfy'(c) 7 Sfa'(c)= (21) Ry exp (—d2/8), rua 
Which is a Maximum at d= (by simple calculus) and 0 as d’>0 or 0. cannot | 
is, the effect of Inaccuracy is negligible for Poor discriminability (things ¢ ur 
get worse) and for high iscriminabilicy (t g | 
effect occurs for int im; 


The conclusions c ion are that 
cost of making a dec 


SO su ion 
; F Cy against the consequence tl 
decision again finds != iE 


Cc 

i vious 5€ nse 

Whereas in the pre trespoPt 
nimodal and bimodal cos e 


b its 
: “IS the point where inaccuracy has 1 
Uli and an Unbiased pay-off matrix. 


ion Point between u 
bias curves, in this Section 
effect for equiprobable stim 
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4. SOME APPLICATIONS 


b There are difficulties in attempting to apply some of the results that have 
cen derived for commensurability: the measurement of h, the width of the zone 
ome techniques: in a confidence 


of Uncertainty, would involve some cumbers 

rating experiment it would mean asking the subject how confident he was about 
his conlidence ratings, and in a reaction time experiment, the supposition that 
the subject goes on calculating until he reaches the limit of accuracy determined 
by h, would imply that h could be inferred from the upper limit of reaction times, 
when artifacts such as the subject falling asleep or otherwise failing to respond 
are allowed for: obviously this sort of procedure would seem arbitrary. For 
these reasons, no attempt is made in this paper to review experimentally the 
hypothesized relation between the accuracy parameter h of models 1 and 2 and 
Some discussion of the commensurability aspects of cost will 
aper on communication with artificial languages. 

the discussion is restricted to the demonstration 
12) can make relevant predictions in a number 


discriminability. 
be attempted in a later p 

In the examples that follow, 
that one of the cost models (mode 


of fields. 


‘The study of Carterette et al. (1965) relates, inter alia, 
e for different probabilities of a signal in 
f cost it is possible, at least 


(1) Reaction time 
and mean reaction tim 
on time as a measure 0 
sults quite neatly. 


discriminability 
noise. ‘Taking mean reacti 
qualitatively, to account for certain of their re 


Probability a 
R 


pls) 


Decision 
axis (5) 


(a) 
tions and mean reaction time for 


Ficure 6. Relation between probability density functio! n { ion ti 
« Yes ’ reaction time; ©, ‘ No’ reaction time. 


stimuli of high discriminability. O, 


st readily be represented graphically, Figs. 6 (a) 
d with a lax criterion A (high probability of 


Using model 2, since it can mo. 
B, and a cautious criterion C (low 


and 7 (a) show the cost associate 


saving * Yes ’), an unbiased criterion 
probability of saying ‘ Yes’). Tt is assumed that the same three values of B (the 
are used to determine the positions of A, B, C in both figures. 


likelihood ratio) 

"'hese positions h 

to predict the data. 
D 


ave been chosen a posteriori to fit the data rather than a priori 


Assuming such different criteria are adopted as the 
5.P. 
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ion time afd 
a priori signal probability changes, and mean cost and mean EE 
equivalent, Figs. 6 (b) and 7 (6) indicate the predicted variation if abil 
with a priori probability. The essential point is that for low Lee Steril 
(Fig. 7) the cost and reaction time for ‘ Yes’ is higher for the un high di 
(B) than for either of the biased criteria (4 or C); whereas In Tie Ce ted 
criminability case (Fig. 6) the unbiased criterion gives a reaction tin dl 
mediate between the reaction times for the other criteria. Carterette চা, 
(1965) report data for three subjects: for two of them the mean re! if 
Priori signal Probability curves are as in Fig. 6 (b), but for the other subje 
Curves are as in Fig. 


secrimination! 
7 (b): this Subject has also markedly lower discrimina 
This is mild Support for model 2. 


Probability 


Decision 


axis i 0 
Xl 
| (a) (b) fol 
a . ‘1 i i ত 
IGURE 7. Relation between Probability density functions and mean reaction যা 
Stimuli of low discriminability, O, ‘Yes’ Teaction time; @, ‘ No’ reaction time 
’ 2 i 
(2) Vigilance Broadbent 
term iti i 


. cast 

দৰ in this 2° 

Suppose the effect of fatigue in ঃ ct 

p ন ly altering the criteria. He can a, Ll 

Utious Criteria, ED E , ore cautious, js 
12, by making them m 


j নি 
Cautious criteria Me more cautious but 
This is Broadbent & Grego | 


s || 
cr Acting noise on the task. To 1 
ease Cost; (the attention mechanism works like 4 ণ i 
ত L ects attention to those Parts of th t where-most C the 
being incurred); incre an be ach, 'e system L making is 
criteria converge on X y making the a in Fig. 8 (b) by this aga 
Broadbent & Gregory’s finding. Cra closer together— 
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cel OES As a final example, the relation between response bias and 
NT # ity in the consonant systems of natural languages will be discussed. 
A oS h inguists (e.g. Jakobson, 1953; Chomsky, 1965; Halle, 1958) have 
Ton ট that each phoneme should be ‘represented as a collection of binary 
i: ctive features. Results for the consonant system are presented which are 

ased on one such feature analysis (Halle, 1958). While many linguists (in- 
cluding apparently Halle himself) would not consider this particular analysis the 
best in all its details (e.g. Halle, 1965, now uses in addition the feature gradual 
release for describing the consonants of English), the following argument is 
unlikely to be affected by minor adjustments in the details of this system. The 


Probability 


(a) 


ABXCDE Decision axis 


Cost (Model 2) 


(6) 


ABXCDE Criterion level 


FicuRE 8 
Cost derived from probability density functions in a vigilance task. 


o extra consonants added). 


Fis 58) sy' i 
alle (1958) system is shown in Table 3 (with tw 
m for the follow- 


Response bias and discriminability will be looked at in this syste 


ing six cases. 
and immediately preceding 


ng of words 
g in the Thorndike-Lorge 


1. Engli | 
long a কস ashen at the beginni 
a ather). S : all word ccurrin 
(1944) word count. ) ource: all words 0 
2. English consonants at the end of words 
Source: Thorndike-Lorge. 
3. English consonants at the beginning of w 
the liquids (l or 7). Source: Thorndike-Lorge. 


and immediately following long a. 


ords and immediately preceding 


UL 
[5] 
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4. French consonants at the beginning of words and imEdintey y wie: 
a (the vowel sound in the French words chat and rare). Source: 
Shorter French and English Dictionary (1940). 

5. German consonants at the beginning of words 
a (the vowel sound in the German words lamm an 
Compact German Dictionary (15th edition, 1958). 

6. Welsh consonants at th 
ing the mutations that occur i 


and immediately ET 
d lahm). Source: Cassell's 


go ye bi lg ‘orlect- 
e beginning of words (radical forms only, negle 
n the initial consonan 


of ই] 50 
ts of some words). Source 
The New Welsh Dictionary (1953). 
J THE PRESENT 
"TABLE 3. HaLLe’s ANALYSIS OF THE 20 CoNSONANTS* WHICH APPEAR IN THE PF 
SAMPLES 

P as in par kas in car 0 as in thy Jas in jar 

b as in bar E as in garden n as in name in sharp 

m as in mar tas in tar 5S as in sat 

f as in far 


2 as in pleasure 
1 as in sing 
X as in Bach 


d as in dark 


V as in vase 0 as in thigh 


Z as in zebra 
Cas in char 


PBPbmty k #tdAV nse €I কঠ DD 
Erave ফা কক HE = 5 iE FR = == চক লক লা অ কু সা 
diffuse EE EBERLE ESS el 
strident EEE ESE ES EEG BET LTE ES 
nasal SE NEES: ot Ete tL 3 Ne 124, 8 Ge Ge et: HE 5 
continuant PELE a5 Sf নত == = = = ৯ is Sa + - + 
voiced সু fe 2 Hh = f= = কু 3. £ = i নাগ 


features — Vocalic Fconsonantal. 


| Le are 
CCs are as follows. In general, we # 
Ant system in a single i 


ঠি ince it is clear 
€ special environment, since it 1S 


» the 


. i a 
ৰ j ginning of words and immediately P 
(1559 ue 7 Was adopted since it w: je] Sh mnt thar Miller & Nicely 

use i in 

( ) acoustic confusions betting 
m this decision, Man! ue 
Ord, as in case 2) or languag mp 
gh: Wn Bhout Welsh vowels to atte al FT 
Sh; instead, Consonants at the beginning © 
Howeve 


are 
tr, the rules Eovernin hese system: 

: L £ all these sy 

2 majority of the Possible conson 


:n eve 
ants can occur In 


consonants. 


+ Insufficien 


analysis for Ww 
Welsh words were studied. 


not Very restrictive—. 
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here the rules impose severe restrictions on 


case—and in contrast a system Ww 
initial English consonants immediately 


possible choices was studied (case Ee 
preceding liquids). 
In all these cases the technique of analy 
DUEMNTENGE of each consonant in the environment in question was determined: 
frequency of occurrence’ means number of words having this particular 
consonant in this particular environment, not number of occurrences of these 
words. This convention has been adopted since number of occurrences of 
Words is very hard to determine (even Thorndike & Lorge is not complete in 
this respect), whereas number of words can be readily determined from a 
dictionary. In casel,a check was performed on this procedure by showing that 
words beginning with a particular consonant did not interact with the frequency 
class to which they belonged, i.e. number of words and number of occurrences of 


words are equivalent measures of frequency. 

When a word count such as Thorndike-Lorge is not available, small 
» dictionaries have been used, on the grounds that these will contain 
t words of a language, and rare words, which would be found 
d might not be typical of the language as it is commonly 
Appendix 3 gives the results of this statistical pro- 


sis was the same. The frequency of 


‘ concise 
only the frequen 
in larger dictionaries an 
used, are thus omitted. 
cedure. 

An empi 
observing response bias. 
Diased than unbiased for 
biased than unbiased for 
poorly discriminable dimensions of the conso 
bias than would be expected by chance, but along 
sions there would be less than chance bias. FE 

This hypothesis has been tested in the following way. First, from a given 

eon a particular 


system, the number of consonants with a particular feature 
dimension has been counted (e.g. in case 1 there are 10 + diffuse consonants 
and five — diffuse consonants). Then the total number of occurrences of each 


of these types of consonant has been counted (in case 1 there are Ee 
Occurrences and 164 — diffuse occurrences). ndex is the 


The following 1 
calculated: 
Ratio of number of occurrences of consonants having 
rrences of consonants having t 


feature to number of occu 
i . ~ feature l ট 
onniiots mber of Consonants having a particular feature to 
ite feature 


Ratio of nu : ; 
r of consonants having the oppo 
are not less than 1; in this example 


s on these data can be achieved by 
it is less costly to be 
ly to be moderately 
dicted that along 
ould be more 
able dimen- 


rical test of the cost model 
Models 1, 2 and 3 all predict that 


low discriminability, but more cost 


high discriminability. Thus it is pre 
nant system there Ww 
highly discrimin 


a particular 
he opposite 


numbe 
nd denominator 


where both numerator & 
386/00 
bias ratio= TE - 
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. . r of 
Thus the bias ratio will be greater or less than unity PAE OY as i একলা 
occurrences of consonants is more or less biased than the num RT Sf Conc 
i.e. the bias ratio will indicate whether the bias of el if the 
actually observed is more or less than the bias that would be 0 EE 
frequencies of occurrence were assigned to the possible SOTSONID Ee Taney tok 
‘Table 4 shows the results of this proceduie. There is a ble a IE 
consonants with high random bias to have lower bias ratios than those wi 


random bias (Mann-Whitney U-test on high random bias (> 2) and low random 
bias (<2) groups, 2 per cent level, tw 
Wallis one-way analyses of v 
significant results are obtained (H= 


< 0-001 for high random bias group). fis 
tio, corrected for the difference between GR 
Ws: grave, 1-70; diffuse, 1-325; strident, 2° ? 
Voiced, 1-02. These values are computed 
as group and the low 
ach dimension how much on average 
its bias ratios exceed t 2: ach belongs to, and finally Addn 
i both random bias groups Les 
TY procedure since the different 
ers of bias ratios from the various 
er, it probably represents a fair first approximation. টি 
inability of the phonologie 
r frequency response 200-6500 Cpa 
ial about these Physical parameter 
Cest sound spectrum Miller & Nicely used, with 
i noise level, enough to produce confusions, but no: 
Excessively so), x, the choice theory measure of discriminability, takes bs 
: grave, 2.33; diffuse, 2.86; strident, 2.56; nasal, 31:3; con 


. . . . 4 1 i ed 
্‌ 3 Le. descending order of discriminability is voice 
nasal, continuant diffuse, strident, grave. 


. R of 
Correlation between Order of bias ratio and order 
The rank Orders of the 


. . 1 A 
bias ratios from cach of the six samples 
Corrected in the manner described above, Agree well with each other (Kende 
1) and the rank Order of the bias ratios from each wa 
ly with the order of discriminability derived from Miller tly 
mean Kendall Correlation for the six samples = 0.44, significan 
Zero (t-test, P<0.001 b n 
nt hypothesis would predi St perfect correlation betwee 
Sures: failure to achieve thi 


sine 
: Perfect correlation could be explain 
by the unexpectedly low bias ratios of t 
grave and diffuse. Now 


1 of the bias ratios of 
together. This is not an e 


random bias 
dimensions. Howey 
This result can 


discrim 
features. Using Miller & Nice 


ly’s data, fo 
is nothing spec 


correlates positive 
Nicely (1955), 
different from 
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TABLE 4. Bias RarTios 


Ratio of number of 


Ratio of number occurrences of 
of consonants: consonants: 
‘ random bias ° ‘ actual bias * Bias ratio 
Case 1 
Erave 432:118 3:20 
diffuse 386:164 1-18 
strident d 110:440 2:67 
nasal 13 104:446 0-66 
continuant :10 83:467 2:81 
voiced ৰু 235:315 1-17 
CasE 2 
grave 79:112 0:85 
diffuse 142:49 1:32 
strident 60:131 2:18 
nasal 27:164 0:87 
continuant 53:148 27 
voiced 85:106 1°25 
CaAsE 3 
grave 1622 :474 2:15 
diffuse 1501:595 1:51 
strident 372:1724 2-11 
continuant 410:1686 2.47 
voiced 638:1458 1-03 
CasE 4 
Brave 2288:798 2-87 
diffuse 1 2195:891 0-99 
strident 750:2336 2:34 
nasal 613:2473 0-67 
continuant 750:2336 2-34 
Voiced 1340:1746 0:98 
CAsE 5 
Brave 421:259 1-16 
diffuse 511:169 1:01 
strident 176:504 1-43 
nasal 167:513 0-61 
continuant 176:504 1-43 
Voiced 419:261 1-15 
CasE 6 
Erave 4168:2149 1-45 
diffuse 3812:2505 0-61 
Strident 666:5651 3:39 
nasal 603:5714 1:57 
continuant 769:5548 4-01 
voiced 2818:3499 1-24 
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cost treatment. (Thereis an old study of Sacia & Beck, 1926, reported by eee 
1951, where energy involved in articulating a consonant and its NOTE S 
occurrence are negatively correlated.) lItis possible that these latter considera 
tions will have some influence on the di ! ন 
diffuse dimensions, being most associated with the largest physical changes it 
the articulation process, might be most affected. | Ba 
However, a case has been made out for at least some contribution of ia 
criminability and cost to the distribution of consonants in a number of as 
environments and in a number of different languages. It is suggested that ol, 
factors have shaped the statistical Structure of these languages over the course Se 
time; i.e. the interaction between discriminability and cost has led speakers 9 
the language to adjust their response biases for words of different phonetic 


Structures, and this ad the frequency of usage of differently 


i i i + crave and 
Stribution of features, and the grave an 


Justment is reflected in 
Structured words. 


5. SUMMARY 


In restricting the discussion 
of signal detectability theory in very 
ention has been to lay a foundation on which ultimately 
ilt having sufficient elaboration to satisfy, for example, 
Control theorist. However, some successes have been 
i fice of this paper: (1) the device gy 
taking some of the proce g detectability thevne Tires (namely, 
00d ratio and determine whether i 
iterion level) bei 
ghtly Unexpected interactions between discriminability 
larity between models 2 and 3 means that signa 
Oncepts can be used interchangeably in this sort 
12 in its analysis of sue 
and linguistic studies. ‘This, oe 
: by definition, it must be widely 


has produced models tl 


topics as reaction time, Vigilance 
course, is the value of the co 


Oncept 
applicable. Cpt of cost 


i fford 
» and partly while he was Con 
he Queen’s College, Oxford. ‘The “ation 
Audley for help at many stages in the prepara 
n Sectio 


. H [9) 
1 3 was improved by the suggestions 


wishes to thank Professor RT. 
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APPENDIX 1 | 
Central Dip in the Cost Function for Large d' 
», the sum of the probability density functions of two eqiprobable normally 
distributed stimuli is proportional to exp (—1x°)+exp [—1(d’—+)*]. So that 


dyjdx oc —x exp (-1x2)-(x—d') exp [= 1(4' — x)°] | 
and 
dyldx oc —exp (1x2) + xt exp (— 1x2) exp [402 3) 
+(x-—d') exp [-1(d' —- x). 
At the central criterion level, +» = ১d’, 
dyldx=0 and dyjdx®ccexp (— ida? 2), p 


which is 2 0asd'Z22. Therefore x= id’isa maximum if d' <2 and a minimum 
if d’>2, i.e. the central dip in the cost function appears only for d'>2. Q.E.D. 


APPENDIX 2 


Derivation of Choice Theory Results in the Case of Two Equiprobable Stimuli | 
Suppose 1(s, S)=1(s', $’) and write 


1s, s) =< b(r) 
Then 1(s, $’) 7 and 5) =. 
= AUR 
rhs pe, 
EE 
MR > 0 
nb 
(rs) = Ee 
Mr'k)= 
0৮), 
Mean cost=E bs) =p) =. 
7 7220) 2200 log pr) 


b 
i Grit 
t £ (16+1)-1 1 
TBF )—log 16] 4 7571 [oe 5-11} 


b 
“toe +2)! ” 
+b 7+) 0g b] + a+ Blog (1+ 0) log on} 
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১ 
“ 2 1+276+62 
slg PDE TR UY [Ee 1)(7 +6) 
blo ’ 1+2176+17* 
8° | GE 1+) 


2(1b + 1)(n লা” b)E= (16+ 1(n + b)[log (nb +1)+ log (17+ b)] 
—1 log n[1+ 216+] —-b log b[1 + 276+ 1°]. 
Differentiating with respect to b, 
aE 
2[1+2nb+ nJE +2016 +1)(1 +6) Ti 


=[1+270+ 70g (16+1)+log (7+ )] 
+(17+0)n+(16+1)-27 log nln +6] 
—b log b[27]— [1+ log b[1 + 276+ 1°. 


Differentiating with respect to b again, putting b=1 and using the fact that 
db Jdb =0, 
1 


aE 
4nE 4204 US =| 
nE+2(n +1) = শশ[;ল দদা 


+212 log (1+1)+1]-2n7 log 7—4n1-—- [+n]. 
When b =1, 


2 log ”. 


2E= l= 
E=2 log (1+1) লক 


- Thus 


i 21 14+ lo 
| (1+) =" [208 (141) log n—1-2 log (1+ D+ IE 


| 1 y | 
= log n1- 1 
% E ৰ £m 
Therefore Eis a maximum or minimum at b 
Or positive, which is according as 


_1 according as d®E|/db? is negative 


= < 
n 1 1. 
lb ৰ) 0g n> 
The Solution to 
। 
1 jogn=1 
n+l Ll 
1S 1 =4.681, 
Eisa পি পল at b=1 according as 75 4681. Q.5.D, 
minimum 
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"FHE USE OF A NULL-POIN'T METHOD TO STUDY THE 


ACQUISITION OF SIMPLE AND COMPLEX TRANSFORMATION 


SKILLS 


By GoRrboN PAsk and B. N. Lewis! 


System Research Ltd., Richmond, Surrey 


A null-point experimental method for measuring human performance is 
‘Phis method can be used in studies of learning and in other situations 


described. 
nt to the experimenter's 


where the cffect of learning is appreciable, though irreleval 
objective. It provides a form of constancy of experimental conditions which is 
appropriate if the subject is conceived as an imperfect goal-directed system. 
‘he method is exhibited in connexion with two experiments and a subsidiary 
study. One experiment involves the acquisition of a skill in which the human 
subject learns to apply a single transformation rule. For the other main experi- 
the subject learns to deal alternately with two rules (so chosen that 
rell as positive transfer of training takes place). Further, in this 
case, the subject learns a concept which (in an operational sense) comprehends 
the pair of rules. ‘The subsidiary experiment involves a modification of the 
original technique which may be useful in more detailed investigations of concept 


ment, 
interference as Ww 


learning. 


1. INTRODUCTION 

a null-point experimental method and its application in 
a series of human learning experiments. The method is shown diagrammatically 
in Fig. 1. ‘The subject is learning a skill that entails the solution of a sequence 
of problems (in the cases considered, these are coding problems, but the method 
is quite general). ‘The control mechanism, M, instructs the subject in the skill 
by adjusting the problem difficulty as a function of his behaviour, so that the 
Subject’s performance is held at a fixed level, the system null-point. 


‘This paper describes 


Ficuns 1. Null-point or steady-state system. M changes the difficulty to compensate 
by 1 id [> «7 

for learning in the subject. ‘The coupled system ‘subject, M ’ does not learn. 

S, subject; M, control mechanism; D, display; R, response; PS, sequence of 


Problems typical of the skill. 


1 Now with the University of London Institute of Education. 
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g ‘vices 
M’s of this type have been extensively used as hi an 
(Pask, 1958 a, b, 1960, 1961 a, b, 1963 a, b, 1964; Lewis & Pas 1 adjustment 
in this capacity M may be designed to mediate a more ET ন! 
procedure; however, for a certain class of Skills, the mAnte ফ! 2 
suitable null-point is also quite a good teaching strategy. pple 
purpose, it is not necessary to examine the teaching function 0 al J ile 
The operation of M is, of course, constrained in various Way ঠু sitfian)e 
first place, M must not overload the subject by presenting problems ud role 
that they are unintelligible. On the other hand, M has to provide su a 
relevant novel experience to sustain the subject's attention. It has been arg i 
in other papers (Pask, 1961 a, b, 1966) that M has to satisfy a number of m 
subtle requirements as well as these. নট 
The experimental advantage that is gained by using an M stabilized ay 
is a standard condition for measurement. ‘The progress of learning can, is 
example, be expressed in terms of the difficulty of problem that the subject i 
able to solve at a constant level of proficiency. The ‘ constant BORE 
measurement is often useful, as it stands, in ergonomics and design studies 
(Hudson, 1963; Kelly, 1966); it is a potent type of measurement when ‘ difficulty 
(or its converse variate simplification ’) can be interpreted within the framewor 


of a problem-solving or learning model (Pask, 1964, 1966), whereas the detailed 
response mechanism cannot be represented in this fashion. Ro 

Most importantly, if we entertain the cybernetic concept that man 18 
goal-directed system, the 


্‌ iate 
n a technique of this sort Provides the apPrope 
condition for measuring his performance. For, if man is a goal-directed syste! 


at all, he is usually an imperfect one. His imperfections may be due to inatten” 
tion or fati toire of mental processes. In any ie 
ন) The null-point method offers the BOs 
bility of studyi itions that ensure constant efficiency in go 
If manis a goal-di d system, then it is this constancy, I 
2 ions, which ought to be preserved fror 
experiment to experiment. 

Several other standard conditions 
methods (Verhave, 1963 61; Pask, 1963 a, b). Tn each case, i 

j (by definition) has nor 
te vidual measurement, by the examinatiC, 
of a coupled ‘ subject, M° i The coupled ‘ subject, 
itis a stable dynamic system an 
measured without difficulty. tiCC 
b d in the place of M. In practic t 

The various M’s used in the pres 


rom his response board. learni 
acquisition of a simple code-le the 


skill. In Experiment 2, A more complex situation where 


M controlled 


« 
| 


A Null-Point Method for Studying Transformation Skill Acquisition 63 


subject Was required to deal with the alternation of two different codes: on any 
trial one code is named and used, but the subject is unable to predict which code 
this will be beforehand. ‘I'he latter situation makes the subject divide his 
attention between the two codes, or the cognitive processes responsible for 
dealing with them, and the rapid and unpredictable alternation gives rise to a 
great deal of interference between the sub-skills of applying the separate codes 
even though, owing to code similarity, there is some positive transfer of training. 
Hence, it is possible to study the development of a conceptual structure that 
enables the subject to treat the complex skill as a whole and thus to overcome the 
effects of interference. 

Finally, in both Experiments 1 and 2, it was possible to observe an interesting 
mode of man—machine interaction (called ‘ participant interaction ’ because it 
is symptomatic of an attempt, on the subject’s part, to control the strategy of 11). 
Briefly, the stimulus response communication between the subject and M can be 
viewed as discourse in a zero-level object language (designated L°). The 
vocabulary of L° consists in signs for stimuli and responses and its syntax is a 
set of rules for constructing legal responses. However, L® contains no legal 
expressions that M can interpret as instructions to change its strategy or the form 
of stimuli it presents. All the same, a subject can learn about the control 
principles built into M and he can misuse L® in order to modify the experimental 
situation. He can, for example, make responses that M will interpret as mistakes 
(even though the subject is able to solve the problem posed by the stimulus and 
is not really mistaken when he makes such a response). As a result of this 
behaviour, M reacts as though the subject had actually been mistaken (by reducing 
the difficulty of the task, or, in a code alternation skill, in a more elaborate 
fashion). 

Various instances of ‘ participant interaction ’ have been observed in the 
conduct of the experiments. Some attempts have been made to investigate the 
phenomenon and to provide higher levels of communication (object languages, 
L1) that bring this mode of discourse into the open. 


2. EXPERIMENT 1 
nt 1 was to realize a ‘subject, M ’ system in the 
specific milieu of a code application or transformation skill (without code 
alternation) and to investigate the stability of different ‘ subject, M ° systems. 
‘The transformation skill has the following characteristics. A subject is 
presented with an unlearnable sequence of stimuli spaced equally in time taking 
the form of configurations of up to four illuminated signal lamps (selected from a 
group of eight lamps). Before the experiment begins, he is told that a trans- 
formation rule, (2, relates the group of signal lamps to a row of eight response 
buttons; he knows this rule and is able to recite it. Given a stimulus (designated 
x) he is required to solve the problem of producing a response, y= O(x), within 
a deliberately restricted interval, 6 (his response is complex, for he must press 
several buttons to denote the transformation of a stimulus configuration). 


The object of Experime 
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Re es inter-trial 

The size of the stimulus configuration, the Value ial বৰলৰ" Shin 
interval are chosen so that (1) a fully proficient subject ET Lr context of $2 
a ely, but (2) a novice finds the problems posed by x in the contex a 
Gln (e.g. his uncertainty regarding the value of vy when x 15 gIven 183 

i ssly overloaded). K লা 

that চা a conditions a ie, can neither perform the skill nor eee” 
perform it unless he is provided with external assistance. i a SE bl 
of external assistance is provided by M, then the subject 1s LOGEC i বলপশা 
various problem-solving behaviours which would, in more relaxed circumsts 
be unobservable. 


In order to start a learning process, M must reduce the difliculty Ea 
situation. Tt does so by ‘simplifying’ or ‘partially solving the RED ন 
posed by the stimuli. A combination of two simplifying procedures 1s use 
this purpose. fl , 8 

One procedure consists in reducing the number of lamps in a eer 
configuration. The other consists in providing cue information that Le 
specifies the correct response. Both of these procedures, as well as I 
combination (to form a conjoint simplifying operation), have been valida যি 
(1) with respect to a computer-programmed learning model in which they ser i 
to solve partially the problems posed by stimuli, s, and (2) by checking, edi 
cally, that the percentage of correct responses elicited by 
simplified stimuli is never less than the percent 
by a sequence of less simple stimuli (and that 


Proficient subject, it is greater). ‘The snare to be avoided is that some intuitively 
acceptable simplifying procedures suffer from an ‘inversion ’ noticed by van or 
Veldt (cited in Woodworth, 10950). Ifa subject is taught to respond to a complex 
stimulus by way of simple exemplars, the subsequent presentation of a oan 
complex stimulus may give rise to more, rather than less difficulty; the subjec 
has acquiied a perceptual motor organization applicable to the complex situation 
the organization apposite to less complex situations has decayed. 


M com 


a sequence of গণ 
age of correct responses a ly 
টী fa fu 
» If we exclude the case of a fv 


During the learning process, 
rate (called ¢ Proficiency ’ and designated p 


of simplification (designated p4) so that the subject's Proficiency is held et 
(p tends to a fixed value po). In the terminal condition, when the subject 
learned the skill, the 


ra al nt this 
ত k of simplification is reduced to zero. After t 
point, further learning leads t 


aan ONSe 
Putes an average correct ce HE 
). It continually adjusts the deg 


y ক Zed in the predicted learning cur 
of Fig. 2. For graphical convenience, the “ degree of simplification ", I 
been transformed to an index of ‘ objective difficulty’ 1 =n — pF Le fixes 
ndition, in which M is seeki hat 

P at po for the naive subject. Phase I represents the controlled part © WS 
learning process where 1 is increased to maintain p c ae 
the uncontrolled increase in p fo 


onstant. Phase I Ji- 
let 2 Tr 7 held at its maximum value (degree of Si. 
fication is zero). Finally, Pha: 


““‘থ VE 
Se IV is an unpredictable part of the cur 


EY cE 


fl 
২১ 
DD, 
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Which, according to his motivation, the subject may either stick at the job or 
succumb to boredom or fatigue. 

The chosen value of po influences the subjective situation of the learner 
and may be expected to modify the objective efficiency of his learning. If po 
is low in value, the subjective situation is ‘stringent ’ in the sense that a small 
increase in proficiency leads to an increase in the difficulty of the task. Con- 
versely, if po is high, the situation is relaxed. Hence, for low values of po the 
subject has to deal with more difficult problems and his uncertainty (regarding 
its solution) is greater, on average, than it would be for a high value of po- 


nN — >» number of trials spaced by At 
FIGURE 2. Predicted form of learning curve in ‘subject, M? system. -----; » Ps 


, 7. Phase I: Initial transient; 7 increases; p approaches p=po. Phase I: 
Steady state; p—>po; 7 increases. Phase III: Uncontrolled increase in p with 
]=Mmax. Phase IV: Unpredictable part. 


Various predictions can be made about the effect of py upon objective indices 
of learning. In fact, we advanced the hypothesis that learning of this type relies 
upon an information processing mechanism (a functional entity, perhaps to be 
identified with ‘ intermediate ’ or “ constructive y memory) and that this mechan- 
ism works best when (1) it is maximally loaded with relevant data, providing 
that (2) it is not overloaded. ‘This hypothesis leads to the prediction that the 
learning rate will increase with decreasing po provided that po is not forced below 


a minimum value. 


Method . 

The different ‘ subject, M’ systems in Experiment 1 were characterized by 
simple and readily instrumented control rules. The control rules all had the 
effect of increasing 7 or decreasing t as the subject’s measured proficiency 
increased. ‘They differed in so far as different values were assigned to a para- 
meter, & (the increment in proficiency required to bring about a unit decrease in 
simplification). 

The level of proficiency, po, at which the subject is held (throughout most 
of the training process) depends upon the value of &; and different control rules 
may, equivalently, be characterized by different mean values of Po: The 
dependent variable in Experiment 1 was an index of the learning rate for groups 


E S.P. 
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i i ifferent resulting 
of subjects assigned to systems having different values of £, different 
values of po and different ‘ stringencies ’. 


Apparatus d in 
Display and Response Arrangements ‘The display and response board use 


a d, es 
Experiment 1 comprised a group, A, of eight signal lamps, lettered a, b, ¢ 


টু & idin 

f, £, h, which were used to deliver stimuli; a TOW, B, of eight ন পা এ of 
knowledge of results data and a modicum of cueing information; a i D' of 
eight lamps that form part of an external ‘ memory ’ facility and SE B, 
response buttons, numbered, 1, 2, 3, 4, 5,6,7,8. ‘The members 0. disfande 
and D were in one-to-one correspondence and separated by a vertical daete 
of one inch. ©The lamps in group A were arranged in two rows of four an 
situated 11 inches above row B. 
Stimulus Presentation In t 
proficient) the stimuli, 
illuminated lamps, 
for an interval of § 

‘The apparatu 
Which are the foll 


= ed 
CV) = cedf, (V)=egdf, (VI) = fheg, (VID) =gafi and (VII) =gahb. Simplifie 
stimuli are Configurations of m lam 


2 মনল ject is 
he unsimplified case (obtaining when the Sy 
%, consisted of configurations of four Bion AO Hl 
‘The stimulus lamps remaine 


Were selected b 
ment 1, At=5.5 
al frequency in t 
Onstraint, the se 


at 
Y a quasi-‘ random ’ Apparat iy শি 
Sec.). Each of the eight Po 
he stimulus sequence over a blo 


: ৰ >d to 
24 trials; apart from this c quence is arbitrary and is intended 


be unlearnable. 


. হ্‌ ion 
Response System A mapping, 2 (experimentally interpreted as a transformat 
rule), relates the elements denoted by the lamps in row B or (since the rot i 
HMOnE-to-one COrrespondence) to the lamps in row C or the response button” ,, 


row D. In Xperiment 1, Q was the stimulus response permutation @ 
b, Ls C, 6; d, 5; e,4;7, 3;g, 8; h, zl 

The Subject responded by selecting a Configuration, y, of response bu! the 
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However, the response buttons need not be pressed simultaneously and the 
subject is permitted to perform the construction of y in any order he pleases. 
To emphasize these conditions and to avoid an unwanted load on his immediate 
memory, each response button was associated with a ‘lock’ relay; this relay 
is energized if the corresponding button is pressed and it remains energized 
until 0-5 sec. after the end of St. Whilst the relay is energized, the corresponding 
external memory lamp in row C is illuminated to provide a visual record of y. 
The interval St is terminated by the disappearance of the stimulus lamps and 
the coincident illumination of those ‘knowledge of results’ lamps in row B 
that indicate the correct response W(x). This presentation persists for 0-5 sec. 
after the end of St (so that it coincides with the image of y, thus allowing the 
subject to compare ©2(x) and his actual response, Y). Coincidentally, the subject 
receives an auditory and visual signal (buzzer and lamp) to indicate whether his 
response is completely correct (all response selections correct) or mistaken. At 
an instant, 6t+0-5 sec. after the appearance of the stimulus, the ‘lock ’ relays 
are de-energized and the lamps in rows B and C extinguished. ‘There is a rest 
interval of 1 sec. and the next stimulus is presented At=5-5 sec. = 6t + 0°5 + 1 sec. 
after the last. 


Simplification Procedures M, schematized in Fig. 3, includes equipment for 
mediating two simplification procedures. One of these involves a variation 
in m (the number of lamps in a stimulus configuration) and the other involves the 
provision of cue information if, and only if, a binary ‘ cue variable’, 2, is equal 
to 1. 

The first procedure relies upon the mechanical embodiment of the following 
hierarchical array of subsets of the unsimplified (or m=4) stimulus configurations 


(which form the upper row in the hierarchy). 


m I HI II IV Vv VI VII VIII 
4 abch abdc bdce cedf egdf fheg gafh gahb 
3 abc bdc dce def efg Jeh hga ahb 
2 ac bd ce df eg fh ga hb 
1 a b c ad e fi [4 h 


At any trial, the equipment receives an instruction (1) to select a basic 
stimulus configuration out of the eight possibilities (labelled here with Roman 
numerals) and (2) to select this stimulus with a given value of m. Part (1) of 
the instruction determines a column in the array and part (2) determines a row. 
‘The element at the row-column intersection is displayed in lamp group A. 

The other simplifying procedure consists in the provision of cue information 
through the row B lamps, within the interval, St, in which a response can be 
submitted and accepted. Cue information partially specifies a correct respon: 
y=0(x). As above, it is provided if a variable 2=1 and is not provided tt 


2=0. Specifically, if 2=1 and if x is an m-lamp configuration, then after the 
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Control Rule As shown in Fig. 2, M includes a decision rule or control rule 
which accepts the current value of p(n) and determines the value of u(n) or its 
converse variate n(n). For Experiment 1, M embodies the control rule Fo 
shown below: 


An(n)= +1 if p(mn)>t£ 
unless n(n)=tmax=8 When An1(n)= 0; 


An(n)=-1 if &> p(n) 
unless nN(n)=mmin=1l when An(n)=0; 
An(n)=0 if E€>p0n)> Er. 
Initial condition; that 1(0)=1; &€*=£-—0- 15. 


The value of £ (the parameter varied in the experiment) determines the 
stringency of the situation, since it specifies the increase in p required to produce 
a unit increment in 1. 


Subjects The 30 subjects used in this experiment were young men and women from 
various backgrounds (about half of them being students and half being either Mensa 
members or local people). They were all familiar with mechanized situations, since they 
had acted as subjects in this laboratory, for other experiments. 


Procedure As a preliminary treatment, each subject was introduced to the apparatus and 
the skill and he was rehearsed on an irrelevant transformation rule. After this, but before 
the main experiment, each subject was required to memorize Q and to recapitulate it: 
(1) by drawing the mapping without any mistakes; and (2) by pairing lettered stimulus 
elements with numbered response elements. 

As a result of the preliminary treatment, the subjects knew what to expect from M, 
but the character of the control mechanism was briefly discussed and any questions about 
it were answered. 

Finally, the subject returned to the apparatus and was told that he must learn to apply 
Q successfully to unsimplified stimuli, e.g. in a terminal condition of = max = 8. 

Groups of 10 subjects each were assigned to three parametrically different control 
rules: £=0°55, 0°65 and 0:75. (These values of £ had been found workable in pilot experi- 
ments and they yielded po values in the regions 0-5—0°6, 0°6-0-7 and 0-7-0°8, respectively.) 
The value of p and 7 was recorded for each subject at each trial in the experiment to which 
he was assigned. 

In order to compare learning in each condition, it is necessary to define a terminal 
criterion of proficiency. ‘This criterion was taken as: 


N(mM)=Tmax=8 and p(n)> 0-75 


for 2r=16 consecutive trials. The number of trials required to reach this criterion is 
designated T and T values were recorded for each subject. Since At is constant, Tis also 
an index of the time required to reach a criterial level of performance. ‘To eliminate the 
possibility of a subsequent regression in performance, the experiments were run until the 
subject achieved a criterion of p(n) > 0-8, again for 27 consecutive trials (in fact, no regression 


occurred). At this point the subject was told that his performance was satisfactory and he 


was dismissed. 
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‘The value of T slightly overestimates the number of trials (or the time) taken to 
through phase I and phase I] of the learning curve shown in Fig. 2. Since there is Dit 2 y 
rapid increase in p (from its steady-state value of Po) once that 7 consistently equals max: 
the estimate is not widely different in form for the different values of Po- 


Results k 


The behaviour of all the subjects (in each of the three conditions) con- 
formed to the paradigm of Fig. 2. A typical learning curve (man-machine 
system trajectory) is shown in Fig. 4. Itis fora subject who was assigned to the 
£=0-55 condition and who attained a Po Value of between 0°5 and 0-6. in Ua 
region that corresponds to phase II of Fig. 2 these trajectories are stable in 50 
far as perturbations in p are compensated by variations in 7. 


|| 
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Fg 
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চু || { 
1 
48 72 
g 96 2 
$ 120 144 6s it 19 
FiGuURE 4. Learning curve for single subject. £=0.55; T=181. 
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Applying Jonckheere’s (1954) trend test to the original data, S/o=47 and the 
null hypothesis is rejected in favour of the predicted trend Tg>Tp>T, 
(P < 0-001). 

It thus appears that T is least (and consequently that the learning rate is 
greatest) when & is least (and p, is least). 


Discussion 


‘Taken alone, this result indicates that learning is enhanced when the sub- 
ject works under high information loading conditions (when he must attend to 
and deal with problems that are relatively difficult at a given stage in the learning 
process). Combined with the observations from the pilot experiment, the 
result suggests that learning is most effective when the subject is maximally 
loaded but not overloaded. fj 

‘This result is consistent with the hypothesis previously advanced. Itis also 
possible, but we believe.mistaken, to interpret it as evidence that a subject 
learns most rapidly in conditions that force him to. make a large (but less than 
critical) number of errors. The fact is that the subject is not really penalized 
for making an error; many of the events that M counts as ‘incorrect ’ are ‘ omis- 
sions’; although M dichotomizes events into correct and incorrect responses, 
the subject observes a ‘ knowledge of results ’ signal containing several possible 
degrees of correctness. On the other hand, the data do suggest that in these 
eo the delivery of knowledge of results is far from a simple reinforcing 
act an there 1s plenty of evidence that fairly elaborate features of the situation 
(such as an increase in 7) serve as higher order reinforcers. 


Observation of Participant Interaction If a subject accepts the experimental 
instructions he tacitly agrees to a certain legal interpretation of the events in the 
experimental situation. He agrees, for example, that stimuli are interpreted in 
the context of © as posing problems and he agrees to construct responses that 
so far as possible, denote 2 satisfying solutions to these problems. This body 
of agreement may be summarized by an assertion that the subject uses an object 
language L° that is concerned with posing and solving problems. 

The misuse or illegal use of L° in ‘participant interaction’ was briefly 
considered earlier in the paper. It is revealed by the occurrence of repetitive 
and often idiosyncratic responses that count as mistakes to M but that are not, 
in a functional sense, mistaken. In retrospect, a subject who behaves in this 
fashion agrees that he was playing with M; that he responded in a manner 
designed to make M alter the value of % rather than responding to denote the 
solution of a problem posed by the stimulus. Further, the idiosyncratic response 
behaviour is correlated with changes in % (a particular response pattern termin- 
ates, for example, when 7 decreases in value). 

It is difficult to quantify ‘ participant interaction ’ in the present situation 
(though some experiments intended to achieve quantification are in progress). 
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« sub-skill ". 
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Experiment 2 was concerned with the instruction of complex SBS firs 
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Experiment 2(5) WW see 
In order to compare the acquisition of a code দ অত == ত into 
conditions, A and B (described below), the stimulus যর Before the beginning 
six trial blocks separated by rest intervals of 5 0) 0 vas £0 HE applied, and 
of a block, the subject was informed whether ©, or t (at the beginning of the 
this instruction persisted until the next OEE unt: of Jnterbsence 
next block). Although this arrangement reduces t if the transformation rule 
between sub-skills 1 and 2 below the level assumed Ht Le for the subject to 
is liable to be changed at any trial, it still seems eV) FoNers a concept that 
achieve an adequate measure of proficiency unless e tole 
overcomes interference and allows him to regard the job as a 


i#(n)=itn) 


FIGURE 5. Control system used in Condition A for sub-skill alternation. Key asin Fig. 3. 
থে 6, computation for alternation rule F1, 


‘The two conditions, which are Juxtaposed in the experiment, differ in the 
design of M and the type of communication permitted between the subject and 
M. In Condition A, Mis designed according to an hypothesis that rapid and 
alternating rehearsal of the sub-skills is a prerequisite for the attainment of an 
overall conceptual organization. Specifically, M detects the subject’s proficiency 
in respect to each sub-skill and rehearses more frequently the sub-skill at which 
the subject is least proficient. ‘This alternation strategy is embodied in a higher 
level control rule, Fi, ‘There is, of course, an even less elaborate alternation 
rule than Ft; namely, the equiprobable rehearsal of the sub-skills for all values 
of n so that (if Tri(n) is the frequency of rehearsal of sub-skill 7 at the nth trial) 
Ti(n)= i=}. This condition has not been included in the present experi- 
mental design because it is known, ‘from previous work (Pask, 1964) that 
Ti(n)= i=} yields a slightly but significantly lower rate of -! i 
ignificantly higher value of T) than Fi. Ina study that দা a ee eds ন 
skills (limited at m= 3, however, rather than m=4) the trend 0 h! Sr 307 V 
a group of 20 subjects learning by F! and a group of 20 subjects learning by 
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difficulty, 7,, 72) by the sub-systems M°,, of M. However, the same control 
rule, F, is used for each sub-system, M°;, and py is chosen, on the basis of 
Experiment 1, to minimize the expected value of T. Hence, po approaches pin 
(this choice involves an Obviously inaccurate assumption of quasi-independence 
between the sub-skills). 

Next, the sub-skills are alternated. For Condition A this alternation is 
effected by a higher level control rule, Fi, that is built into Mt of M in Fig. 5, 
and which has the form: 

(n+ 1)=not in) if 7(n)=5 
=1 if 5>r7(n) and 72(n)> n(n) 
=2 if 5>7(n) and 7i(n)> 72(n) 
=1 with probability of 4 or 
2 with probability of 3 
if 5>7(n) and ni(n)= n(n). 

Here i(n) names the sub-skill to be rehearsed throughout the nth block of 
trials; 7,(n) and 14(n) are degrees of difficulty associated with sub-skills 1 and 2 
by the quasi-independent application of Fo; finally, 7(n) is the number of con- 
secutive previous occasions upon which the value of the index i has not been 
changed. The restrictions entailing 7(n) ensure that sub-skills 1 and 2 are 
sampled regardless of the state of the coupled ‘ subject, M ’ system. 

Control rule F1is a member of a class of control rules intended to maximize 
the product of variables Such as the 7i(2). These rules lead to the most frequent 
rehearsal of the sub-skill at Which the subject is least proficient and if the subject 
5 Ef Skill, they yield convergence to a terminal condition in Which, at 
trial n= 1, 

NAT)=nAT)=m  PA(T)=pA(T)> Po AT)= sA(T)=3 
(where (nm) is the frequency with which the ith sub-skill is rehearsed). ‘The 


terminal criterion is specified by adding the stipulation that PUT)>0-75 and 
PA(T)> 0-75 to the terminal condition cited above. 


Condition B allows the subject, rather than HM, to exercise control over 
the alternation of the sub-skills. ‘The subject was told that he must ultimately 
satisfy the terminal criterion and he was told, also, that he must rehearse each 
sub-skill for at least one block of trials in each group of five blocks of trials; 
the latter instruction constrains his activity in exactly the same Way as the r- 
dependent part of the M control rule Fi. He was provided with the values of 
n(n) and 7(n) through the indicators G and H and was further instructed to 
press either I or J, but not both, in each rest interval when lamp E is illuminated 
(thus selecting one sub-skill or the other for rehearsal throughout the subsequent 
trial block). ‘To avoid procedural mistakes or disobedience, a buzzer sounded 
if neither I nor J was pressed whilst E was illuminated or if both I and J were 
Pressed. ‘This buzzer also sounded if the subject tried to select the same sub- 
Skill on more than four consecutive trial blocks (in fact, if such a selection had 
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been made, it would have been rejected automatically by the system; however, 
none of the subjects did persist in disobeying the sampling rule). ‘These 
arrangements are schematized in Fig. 6 wherein the communication channel 
involving G, H, I and J should be regarded as the medium for a higher level of 
discourse, L1. 
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be Results 


‘Typical behaviour in Condition A is shown in Fig. 7. One salient SE 
“between Conditions A and B is that Tis less (learning rate is EE Ee 
A than it is in Condition B and the generality of this result is exhi ণ Fae 
following data, comparing the performance of the 10 subject groups ru 
Conditions. The T means are in terms of six trial blocks. 


Mean of T  S.D. Median 
‘ 76 
Condition A (N=10) 76-2 13°14 
Condition B (N=10) 1164 20-81 109 et 
i i ' T, and Ti; were found to 
Using the Mann-Whitney test, the values of T, 2B. 4 
be SiSnificently different (U=3; P<0.001). Hence, Hypothesis (1) (learning 
rate) is supported by the data. 
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FiGURE 7. Between trial 24 (block 4) and trial 408 (block 68) 0.65 > p> 0-5. 
র্‌ line; 72=light line; in) =1, black; in) =2, White; 
M1 and 72; €= 0-55. 
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H ion A the subject received 
lH Of assistance, by way of simplification, with respect to 
each of the sub-skills. ‘Thus, he was forced to learn these sub-skills as a whole. 


An indication of this fact is the low value of the area, A(T), between the curves 
for 7,(n) and 12n) up to n= T. run in Condition B were 


Prone to rehearse each sub-skill separately, until a Point, usually around block 
yt 80, when they found the effort fruitless and adopted an alternation strategy. 


» the value of A(T) is higher than 
i by a comparison 
~ Of the values of A(T) and A(T)/T for the subjects in Conditions 


A and B. 
AT) XT)IT 
Mean S.D. Mean S.D. 
“ Condition A (N= 10) 63-4 22:21 0:81 0.16 
Condition B (N= 10) 175-7 53:35 153 0-27 
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Latency Patterns To investigate the remaining TEL iesis (Ea og eT 2), 
the subject should learn to perform sub-skill 1 in the same tig behaviousllll 

it was necessary to obtain a more detailed picture of the রে tr ceeconad 
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latency patterns (or, neglecting the sequential ordering of components, t J 
classes, ay, by, C4, ds, eg, indicated below): 


24= quadruple of four ‘ c 
d=" triple + single’ or ‘ 
€4= double + double ° g 
b=" double + single + single’ or single + double + single ’ 
Or ‘single + single + double i 

44=" single + single + single + single ?. 

Using the same classi 
two classes (b, and as) 


For each Subject, each sub-skill, 7 
response patterns i i 


Oincident ° responses; 
single + triple? ; 


(a) for m=1. 
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Condition A Condition B 
Sub-skill 1 Sub-skill 2 Sub-skill 1 Sub-skill 2 


“Ll L 
“AE ee 


100% 


100% 


wes HEE BE: 0 OT 


100% 


100% 


100% 


100% 


EEE HEE. EE NEE ee 


C 


bd 


1 


; FicuRE 8. Histograms showing the development of latency patterns. One subject in 
Condition A and one subject in Condition B. Sub-skills 1 and 2 are represented 


) separately. 


| 1 

yf Ll However, if a subject acquires a common concept that is applicable to 

A b-skills 1 and 2, his correct response latency pattern for sub-skill 1 may be 

J xpected to approach his correct response latency pattern for sub-skill 2. Further, 

খ his tendency should be in very much greater evidence in Condition A, fl 
the subject is forced to alternate the sub-skills, than in Condition B, Whe 
constraint is absent. The hypothesis is supported by the oe. oT B. 
crude ~omparison between the four subjects in Condition A and 000A 

“A rank correlation (Spearman’s p) was obtained between the correc 
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i i i =7 and 7:= 8). Fo 
tages in sub-skills 1 and 2 for the terminal states (ni RE UY 
Upnditlon 2, the p values are high and for Condition B they are lower. ‘Th 


i elation indicates 
test applied to the difference between the coefficients of correlation indic 
that this difference is modestly significant (P < 0-05). 


Stability of the System In conclusion, it is necessary to make SBE ee 
comments about the stability of the ‘ subject, M’ system. ev nef 
in Condition A is stable in respect to arbitrary negative perturbations 0 a 
11 Or 1, the effect of the perturbation being compensated by a positive ET if 
of the unperturbed variable (that is, 7, by 72 and vice versa, not 7 by LE ofl 
Experiment 1). Since the system is stabilized at the lowest possible ot bell 
Po’ Stability in respect to arbitrary positive perturbations of the 1; Wo | 
fortuitous. a 

‘The system in Condition A is Prone to instability engendered by participant 
interaction (the illegal usage of L°) because M is elaborate enough to Er Le 
subject something to ‘ Play with ’ (not Surprisingly, he is usually aware tha i 
is ‘playing with? M). As in Experiment i participant interaction 
encouraged by increasing pv. 

In Condition B, Participant interaction is 
allowed to communicate with M via the L2 
statements’ or ‘statements about the L 


already, the subject’s unfettered control 
learning. 


“legalized ’, for the SUAICC 
channel (to make and receive Ee 
° interaction’). However, as no 
Over the alternation strategy prolong 


Experiment 2() 


|! 
bject was allowed contingent (rather rei 1 

fettered) control over the alternation Strategy in an attempt to ‘legat 
Prolonging the learning process. rted fl 
Stem was an ‘ adaptive meta-system ’ in which M exe 


the 
€ Sort of control over the L1 discourse that it exerted esl Ge 
HM is modelled Upon a tutor who responds more or less symP 
student’s comment. 


e 
ত S about the conduct of a tutorial, on je 
Occasions Accepting suggestions that the topic should be changed, on 8 sk 
Occasions reje implest M capable of performing this of 
forms the ad j System shown in Fig. 9. Logically, this is a hie S 
archically organized Control system, in which there is an L1! control loop 
well as an LO control loop 


5 10: 
fy ucts the goals that he uses in formi 
concepts. Hence, the ada 


ৰ 2 bd 
E ট System is a system that allows us to ext 
nalize this crucial feature FF, 


of concept learning. 
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In general, 6(n) may be any suitable function of the conjoint satisfaction of 
the Lo goals and, in general, it may be used in any way that gives the subject a 
FF degree of L! control that is Proportional to his ability to satisfy the L° goals. 


FIGURE 9. Adaptive meta-system. e, electronic dice-thrower biased by 0(n) for selecting 
i(n) = I(n) or i*(n). Other notation as in Figs. 5 and 6. 


Method 


As in Condition B of Experiment 2(i), the subject made a statement, say 
I(n), for each block of five trials. I(n) designates one sub-skill or the other 
Chosen for rehearsal and it is interpreted as a preference statement, based upon 
the L! evidence. In addition, M makes a ‘preference ’ statement, say i*(n), 
based upon the same evidence. Often, I(n) and i*(n) are not identical and, in 
this case, the system in Fig. 9 accepts I(n) with a probability of 6(n) and i*(n), 
with a probability 1—0(n), the accepted statement which determines the sub- 


ki chearsal being denoted i(n) as before. 
hs ie surile f(n) has the properties: (0)=0, 6(T)=1, 1> 0(n)>0, and 
: as the terminal criterion is approached. 


increases monotonically 
0 সপ O(n) was chosen as 


For the present purpose, the form of 
(n)—-t 
M(n)=piln) * pan) * mln) nn) [L-lriln) $l. 
for thi i They came from the same 
] jects was used for this experiment. y 
1 জল ৯ 4 ate in the previous experiments. 


i jects us | 
population as the subjec PAIN. 


i i i used in Condit! 
e identical with those TLE 


Procedure The instructions ar ft of theranbicatls rn 


2(i), except for an assertion that the accep 
: contingent upon the value of 60m). রি 
J |) 


) 
‘i 
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Results and Discussion of Experiment 2(it) 


Involvement in an adaptive meta-system has a Profound লম" 
effect upon the subject. ‘The learning situation becomes Hamel te চক : Roe 
is legal, in contrast to the incidental Play of Participant Rs LE chan 
rate of learning achieved in this experiment (estimated by 1 ) ডর TEE 
the rate achieved in Condition A of Experiment 2(i). This result ই oh 
prising if the ‘adaptive meta-system ’ really does act as it is meant to RE 
particular, if it eliminates the unwanted aspects of ‘ participant i TE 

For the 10 subjects concerned, the mean value of 7° is 60 and নল Ee 
9.5 (as compared with the values for Condition A and B of 76°2, 5. ন lib 
and 116-4, S.D.=20:81). When the meta-system T values are compare 0 
Condition B (using the Mann-Whitney U test) U=0 (P< 0-001); the SADE 1s, 
between the meta-system T values and those from Condition A Yields LE) ঢা nd 
so that the difference is just significant (P<0-05). Jonckheere’s Cos 2 
test, applied to Conditions A, B, and meta-system, gives S/ a=4-6 and so the 
hypothesis is rejected in favour of the predicted trend 


T;> Ti> Tinota-eystom (P <0:001). Hl ues 
It is also possible to assert that the form of learning, as reflected in net 
of AT)/T, conforms closely to the form of learning in Condition A. rion 
again the conclusion is not at all surprising because M dominates the alternati Ke 
strategy unless the subject is performing successfully. ‘The mean value bt 
A(T)|T for the 10 meta-system subjects is 0-86, S.D. = 0.28. Using the Man চি 
Whitney U test, there is no significant difference in respect of AT)I/T, bey 
the meta-system and Condition A, but thereis a significant difference (P< 0-00 
between the meta-system and Condition B, the inference being that learning 
in the meta-system is unlike the learning in Condition B, but similar to t 
learning in Condition A of Experiment 2(i). 


4. SUMMARY AND CONCLUSIONS 

It appears to be possible and 
adaptively stabilized condition 
Po, throughout most of the lear 
procedures that maximize th 
the subject (to place him in 
method has been applied to t 
ponents. Here, learning is 
heavily and lead him to ov 


useful to study certain types of learning 
Ss that maintain the value of p near to a cons: ng 
ning process. In these conditions, the stabil ad 
€ learning rate are also prone to maximally !9 
2 state of considerable uncertainty). The 1 
he acquisition of complex skills with several Ct ct 
enhanced by control strategies that tax the su o - 
ercome the effect of interference (between the ত 
ponents of the skill) by acquiring a concept of the skill as a whole. sa 
regularities in the learning Process have been exhibited by using this experime ijl 
method. In particular, it is possible to predict the way in which a subject 
accept assistance through Problem simplification. kill 
Experiment 1 provided data about the acquisition of a simple coding i M- 
when human learning is compensated by a null-point control mechanism fl 


' 


4 


| 
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In Experiment 2(i) a similar technique was used to study the acquisition of a 
complex skill in which a pair of sub-skills are alternately rehearsed. The 
comparison of Conditions A and B yielded information about the interaction 
between the sub-skills and the development of a human conceptual organization. 
The learning situations involved in these experiments have also been used to 
illustrate the stability of a coupled ‘subject, M° system and to demonstrate 
an interesting (and apparently ubiquitous) mode of man-machine interaction, 
‘participant interaction’, which is believed to reflect essential facets of goal 
construction and concept learning. Experiment 2(ii) introduced one special 
case of a construction, the ‘ adaptive meta-system ’ which allows the experimenter 
to investigate ‘ participant interaction ’ as a ‘legal activity’ within the experi- 
mental framework (rather than regarding it as a mere misuse of the stimulus 


response modality or object language £0). 
In each experiment, M has been viewed as a measuring and stabilizing 


device which provides otherwise inaccessible data. M may also be regarded 
as a teaching device; its functions in one capacity or the other are logically 
inseparable. Further, M may be generalized as the tool of an experimental 
method (Pask, 1963 a, b); the system ‘ subject, M ’ does not learn and is amenable 
to measurement, even though the system ‘subject’ does learn. Using an 
adaptive meta-system, it is also possible to externalize and measure the goal 


modifying activity of concept learning. 
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NEW METHODS IN MAXIMUM LIKELIHOOD FACTOR ANALYSIS 


By K. G. Joreskoc 
Educational Testing Service, Princeton 


and D. N. LAWLEY 
Department of Statistics, University of Edinburgh 


Until recently the main difficulty in the use of maximum -likelihood estimation 
in factor analysis has been the lack of satisfactory methods of obtaining numerical 


solutions. ‘This defect has now been remedied, and this paper describes new 
rapid methods of finding maximum -likelihood estimates. 


1. INTRODUcTION 
In the field of Psychology, 


factor analysis is most often employed to study 
the measurements that aris 


: 1 € from the use of a battery of tests. It will be 
convenient to discuss factor analysis with particular reference to this type of 


data, though most of the remarks in this Paper are relevant to a much wider 
Context. First of all, : i 


factor analysis. In confirms 


Position to formulate a hypothesis that 
depend. FF: i 


step is to decide how many factors are nee 

and to estimate the loadings on the factors, which are initiall 
What arbitrary manner. A second Step consists of a rotati 
formation of these factors into others which can be given 
interpretation. 

In practice, the above distinction is not always clear-cut. Many investiga- 
tions are to some extent both exploratory and confirmatory, since they involve 
Some variates of known and other variates of unknown factorial composition. 
‘The former should be chosen with great care in order that as much information 
25 possible about the latter may be extracted. It is highly desirable that a 
hypothesis that has been Suggested by mainly exploratory procedures should 
Subsequently be confirmed, or disproved, by obtaining new data and subjecting 
these to more rigorous statistical techniques. 
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The aim of this expository paper is to give a brief review of peti + 
developed methods employing maximum -likelihood 0 eR oe 
each type of analysis. Numerical examples illustrating ক le 
provided in the last section of the paper. Most of the algebraica ie a 
underlying theory have been omitted since they can be found in 0 
referred to in the text. 


2. PRELIMINARY CONSIDERATIONS 
The basic model in factor analysis is 


x= Af+e, 0 
where x is a column vector of p variates, fis a vector of Rk common factors, Ee fl 
vector of p residuals, which represent the combined effect of specific factors an 
random error, and A= [ir] is a p x k matrix of factor loadings. ie 

The residuals e are assumed to be independent of each other and of a 
common factors f. It is also assumed that the elements of f, e and x are pi 
normally distributed with zero means. The dispersion or covariance RAC 
of f, e and x are denoted respectively by @&, Y and 2. ‘I'he matrix Y is diagona 
with elements fii (1=1,...,p), which are termed either residual or unique 
variances. It is further assumed, without loss of generality, that the common 
factors have unit variances, so that the diagonal elements of ® are unities. 
in addition, for k> 1, the common factors are a 
the non-diagonal elements of ® are zeros and thus & becomes the unit matrix 
of order k. In view of eqn. (1) and of the assumptions that have been made, 


2 is given in terms of the other matrices by the equation 
I=ASA +. %) 


This relationship can be tested statistically, unlike eqn. (1), which cannot be 
verified directly. 


2 


en 
orthogonal or uncorrelated, the 


iy | 
Suppose that a random sample of n+ 1 sets of observations of x is Sbtan 

and that S is the matrix whose elements are the usual unbiased sample estimate 
of the elements of 2. In view of the assumptions of normality, the elements ঢ 


S follow a Wishart distribution with n degrees of freedom. ‘This means that 
the log-likelihood function L 


t : corresponding to the information provided by 
1s, neglecting a function of the Observations, given by 


L=_in {loge |X| + tr(S2-1)}. 
| In order to obtain efficient estimates (for large 1) of all unknown parameter 
L is maximized with respect to these Parameters. In practice, itis slightly mor" 
convenient to minimize the function 
3 
FA, &, Y)=loge|| + (S21) logels|-p. ‘ j 
Minimizing Fis clearly equivalent to maximizing L, and the minimum value © 


F' multiplied by a constant is later used as a ‘ goodness of fit ’ x* criterion. 


bh? bh AAR 
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When k> 1, and there is more than one common factor, it is necessary to 
remove an element of indeterminacy in the basic model before the procedure for 
minimizing F can be applied. This indeterminacy arises from the fact that a 
Non-singular linear transformation of the common factors changes A, and in 
Seneral also @, but leaves 2, and therefore also the function F, unaltered. 
Hence, in order to Obtain a unique set of parameters and a corresponding unique 
set of estimates, some additional restrictions must be imposed. ‘These have the 
effect of selecting a particular set of factors and thus of defining the parameters 
uniquely. 


3. ExPLORATORY FAcTOR ANALYSIS 
Suppose that an entirely exploratory factor analysis is to be performed. 
Since the correct value for &, the number of common factors, is unknown, it . 
must be determined by a process of trial and error. ‘This will be discussed 
later. For the present, assume that the value of Ris specified and consider the 
estimation procedure. 
. The factors f will be taken to be orthogonal. ‘Thus & =], and F= F(A, ্) 
পল» the elements of Asnd Y. To define A the condition that 
SERRE: CHIE AER 1S Imposed. Jt is Supposed also that the diagonal 
: Matrix are distinct and that they are arranged in order of 
magnitude. This is usually justifiable in practi Subje 
condition, the function fF is minimized with k Et be es ln LS 
ND espe to ely Bote rec WIth respect to the elements of A and 
ti. By hypothesis, X2 is Now given by the equation 


LI=ANM LY, (4) 


Parameters in A and Wis Dh +p; but the condition imposed 
71) constraints upon them, so that the number of ‘free’ 


‘The number of 
Introduces k(k 
Parameters is 


MR+1) IRR D. 


গ্ৰ Ls hypothesis Tepresented by eqn. (4) to be non-trivial this number must 
ess than 1 p(p + 1), the number of distinct elements in 2. This is equivalent 

to the Inequality 
(p-hyp<p+k. (5) 


Problem of minimizing F (or of maximizing the likelihood) was first 
to th ered by Lawley (1940). Since then Various other writers have contributed 

te subject, A More recent statement of the problem has been given by 
“awley & Maxwell (1963). A number of methods have been proposed for 
Maximizing the likelihood. These have all been based upon a direct numerical 
Solution of the equations 


oOF|/0i, =0, oF |/ofii=0 


y ved alge- 
xP ANd F=l, ak Since these equations cannot be solved alg 


for 7= i f 
FH b ন most © 
Talcally, some Iterative procedure has to be used. Unfortunately, 
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those previously suggested have not been entirely PRC fe ope "ও 
to the final solution has often been extremely slow, and in ke সর a 
procedure has failed to converge. It has been customary to stop " টী Shs 
the maximum correction to any parameter is less than a va 60GB) i 
stopping rule is, however, completely unreliable. As J KESTOE EET 
shown, it has on occasions produced supposed solutions that are v Tr : ন his 
Recently a completely new method has been developed by the aut চক to 
has proved to be completely successful. The full technical details 0 a ন 5 
are given in Joreskog (1966 b), and a computer program written Dy ee 
FORTRAN IV is also available (1967 a). The method concentrates LE ! 
on the function F itself rather than on solving the above equations. ‘I Eh 
iterative, it converges extremely rapidly, and the values of the parameters whi 
minimize F can be determined as accurately as desired. 


Suppose that for given Y the function F has a minimum when A= Ay, 
and define the function f(¥) by 


(¥)= min FAY) =F(Ay, Y). 
A 
Then 


min f(¥)= min AY. 
kd AF 
‘Thus the problem of minimizing the function F with respect to A and ¥ has 
been transformed into that of 


minimizing the function Ff with respect to the p 
Variables Yi. 
For given Y the numeri 


Cal determination of Ay, using a computer, presents 
no difficulties. It consists mainly in finding the k largest latent roots and the 


Corresponding latent vectors of the matrix Y1SY-+, We assume, as is usually 


the case, that these roots are distinct and greater than unity. ‘The columns of 
Ay are very simply related to the latent vectors. 

he minimization of fis accomplished by using a method of Fletcher & 
Powell (1963). No attempt wi i i 
of the method is that in ea 
function f is used to estima 


-degree approximation to the 
of matrices YO), YW)... such that 


ts Ee 
This results in a sequenc 


HA: A) <(¥)). 

‘The sequence converges rapidl 
new Ys) there is an associate 
A matrices, which Converges t 
The procedure begins w 
been shown by Joreskog that 


Y to a final ma 
d new Ads). 

0 a final matri 
ith an initial 
a reasonable 


trix of estimates YW. With each 
Thus there is also a sequence © 
X of estimates A. র্‌ 
Approximation W for Y. It has 
Approximation is obtained by taking 
iid =(1- k/p)(l st), 


where sit is the ith diagonal element of S71, 


A). 


০১, 


ahah 
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In each iteration the method of Fletcher & Powell requires the calculation 
f the fanatics value and also the partial derivatives off obi. The latter 
কং asily found, since they are in fact the diagonal elements of the matrix 
are eas ঢ 


F-U(ALAy +E SJE. 


The value of fis computed as a function of the late hy ড ' 
addition, the calculation of a positive definite Symmetric matrix E of order é 
is required. As the iterative procedure converges, the Sequence of E ates 
converges to the inverse of the matrix of second-order partial derivatives 
Nf] opi, ;, evaluated at the minimum. 

‘The number of iterations require 
of a good initial estimate of E. 
Lawley (1967). By standar 


nit Toots of VASP, In 


dis considerably reduced by the provision 
A method of obtaining this has been given by 
d estimation theory, the final matrix E multiplied 
by 2/n provides estimates of the sampling Variances and covariances of the 
estimates hu. In the same paper Lawley has shown how the Variances and 
covariances of the elements of A may also be found. 

Various stopping rules for the above procedure could be adopted. In 
practice, it seems best to Stop when the value of each of the first-order partial 
derivatives is less than a Small prescribed value. 

When the maximum-likelihood estimates of A 
and f has been minimized, It is Possible to t 


and YF have been found, 
eqn. (4). The minimum va 


est the hypothesis represented by 
lue of fis multiplied by the factor 


n—-(2p+5)/6-— 2R13, 
and the result is treated as a y2 variate for which the number of degrees of 
freedom is 

£{(P-Ap (p+). 
This number is Positive provided that inec 
is accepted or rejected according to whet 
prescribed significance level 
is reasonably large. 


Juality (5) is Satisfied. 
her the value of ¥ 15 b 
+ This y2 test is Valid provide 
A safe rule is that n> 50. 

In the above discussion one important point h 
Certain data give rise to what has often been termed 
a Case the function f has a minimu 
the residual Variances are negative. 
is considered only within tl 
Positive value e. 
0:005. If the sn 


‘The hypothesis 
elow or above a 
d that the value of n 


as not been mentioned. 
a Heywood case. In such 
m only at some point where one or more of 
‘To overcome this difficulty, the function f 
1 region R., where cach yYii> ec, for Some small 
In practice, for Standardize have taken € to be 
allest value of f within R boundary, so that at 
least one of the ii is equal to 6, the Solution is called improper, Since in this case 
f has not attained 2 true minimum within RR 
Suppose that 
equal to ce. 


d variates, we 
is attained on the 


an improper Solution is Obtained in Which mm of the fii are 
The hypothesis is then reframe 


d, and it is assumed that these 
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residual variances are really zero. The corresponding m variates are eliminated 
and the partial dispersion matrix calculated for the remaining p—m variates. 
The first m factors are defined as the m principal components of the m eliminated 
variates. (Ifm=1, the first factor is simply the eliminated variate after standard- 
ization.) The loadings of the remaining p—m variates on the remaining k—m 
common factors are obtained by an analysis of the partial dispersion matrix of 
orderp—m. This yields a proper solution and a(p— m) x (k—m) loading matrix. 
The usual x2 test is now performed with p, k and replaced respectively by 
p-m, k—-m and n—m. ‘The proper solution is finally combined with the 
information provided by the m eliminated variates to vield a pxk matrix A 
which gives maximum-likelihood estimates of the loadings of all p variates on 
all k factors. The final matrix ¥ has zero diagonal elements corresponding to the 
m eliminated variates. The above process is equivalent to an application of the 
maximum-likelihood method in which m residual variances are by hypothesis zero. 
| be frequency with which improper solutions appear to occur in practice 
ten " Ee Re aerigs conducted by Mattsson, Olsson & Rosén (1966), 
are summarized by Joreskog (1966 b), eleven sets 
of data were analysed by the method described here. In all but two sets 
improper solutions were obtained. In some cases these i ডট luti s had 
not been suspected when the data were originally analyse J Ete oa ly t 
the iterative procedure then used was SDeLNGD ysed, owing to the fact tha 
Since the number of common factors requi ed initi E 
sequential procedure for determining k is u ৰ Siar bane pie 
value k;, which is often taken to be 1, or ev hd STAG a ot Smal 
leads to a rejection of the hypothesis (eT 9. k=k; is tried. If this value 
36h Feiectitn, Mhe.value of Eris Nr জঃ ye is next tried, and so on. With 
An acceptance terminates the process A | bof and the 
the largest integer for which inequalit 5 1 upper limit for the value of kis hy, 
the value of ¥? is still significant, this { চা জলদ oe ee 
form represented by eqn. (4) iS acceptable, সম np non-trivial hypothesis of the 
tn certain theoretical. objection, brio : This sequential procedure is open 
serious and the method seems adequate Fg these are unlikely to be very 
It should be emphasized shal Or an exploratory type of analysis. 
been proved that this is the true ee at a final value for k, it has not 
final value of x2 is not much above ex er of common factors present. If the 
hese oD Any MDS aonmon pA, this implies simply that it would 
from experimental error. If, on the nt i these would be indistinguishable 
value of Rk so large that all factors aflers রঃ he sequence of tests JCS 
interpretation, the argument can Be adva rotation, cannot be given meaningful 
should be used even though this does not Re that a smaller number of factors 
bered that, except in artificial sampling e tthe data. It should also be remem- 
les a8eiDnp donsoPlineatityraid ore EEG the basic model (1), with 
The topic of non-linearity bas been J S merely an approximation to reality. 
See also GHEE Papers Tferred fo there ussed recently by McDonald (1967). 


process continues. 


" 


eh 
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4. CONFIRMATORY FACTOR ANALYSIS 


In this section it is assumed that the experimenter has been able to set ks 
hypothesis that defines the parameters uniquely. The hypothesis Rt 
the values of certain elements in A and in@®. Jt may, in addition, speci rt ‘ 
values of some or all of the residual variances hii, though this would be unusua 
in practice. As a rule, the values specified for the elements of A or the non- 
diagonal elements of © are zeros, but other values could be used. Let the 
numbers of specified or fixed parameters in A, ® and (including diagonal 
elements of @) be denoted respectively by n,, 1g and ny.. ‘Then a necessary, 
though not sufficient, condition for uniqueness is that 


2 
n+ ng >Re. 


In- general, it is difficult to give sufficient conditions for uni 
Positions of the fixed paran 


eters are important as well as the numbers. 

A common type of hypothesis is that certain of the loadings, at least k lin 
each column of A, are zeros and that the diagonal elements of © are unities; 
the factors are then correlated or oblique. Another common type of hypothesis 
is that certain loadings, at least 2K(R— 1) in number, are zero and that & is the 
unit matrix of order &, in which case the factors are uncorrelated or orthogonal. 
It is possible, however, to have hybrid cases in which One group of factors may 
be correlated while the remaining factors are assumed to be uncorrelated with 
this group or with each other. The generality of the method gives it great 


flexibility, since it will deal with all such kinds of hypotheses. Technical details 
and several examples illustrati 


ng the usefulness of the method are given by 
Joreskog (1967 0). 


queness, since the 


he total number of Parameters in A, @ and WY is 


PR + IR(k + 1)+p= (2p + R)(R + 1. 


The number 9 of fixed parameters is given by 


=n, + Ngt+ ny, 
and the number of free parameters is 


(2p + R)(k + 1)-s. 
For the hypothesis to be non-trivial this 


number must be less than 
This is equivalent to the inequality 


MD +1). 

P+ q> UP+R(p + RSD. 

To apply the method of maximum likelihood, 

the likelihood, or to minimize the functi espect to all free Parameters. 
As before, Fis given by eqn. (3), and X is piven : 


eqn. (2). Previous methods for maximizing the likelihood i 
kind are referred to by Lawley & Maxwell (1963). 


(6) 
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partial derivatives with respect to the free parameters are equated i ee ig 
after some algebraical simplification, an iterative procedure for so!v 
equations is employed. Recent work has shown, however, that such Procec UIC 
do not always converge. Even when convergence does occur it is usually is 
slow. A better method, for which ultimate convergence is assured, was given 
by Joreskog (1966 a). Experience with this method has made it clear that it 
still sometimes difficult to obtain a very accurate solution unless many iterations 
are performed. Efficient minimization of the function F seems impossible 
without the use of second-order derivatives. 

The present procedure again uses the method of Fletcher & Powell. 
Unfortunately, a two-stage minimization procedure such as that described in 
the previous section is here not possible, except in special cases. ‘The function F 
has therefore to be minimized simultaneously with respect to all free parameters. 
The E matrix evaluated in each iteration converges finally to the inverse of the 
matrix of second-order derivatives with respect to the free parameters. A 
method of providing a good initial approximation for E, and thus of reducing 
the number of iterations required, has been given by Lawley (1967). ‘This 
involves the calculation and inversion of a symmetric matrix G, whose elements 
are approximations to the second-order derivatives. In subsequent iterations 


no further matrix inversion is required, since only simple modifications to E 
are necessary. 


‘The order of the matrices G and E is the number of free parameters. I 
this is not too large, the above calculations are easily performed. But if, for 
example, there were 40 variates and 10 common factors, the number of free 
parameters might well be almost 400. The inversion and storage of matrices 
whose order is as large as this present considerable difficulties. With the 


development of computers having greater storage capacity than those of today, 
these difficulties may well disappear. It has been found that with a G matrix 
of large order a considerable number of non-diagonal elements may reasonably 


be neglected. ‘This means that a fairly good initial estimate of E can be obtained 
by inverting only a number of relatively small sub-matrices of G. ্ 

The elements of the final E matrix multiplied by 2/n provide estimates of 
the sampling variances and covariances of the estimates of the free parameters. 

The minimization procedure starts with initial estimates of A, ® and Y. 
The better these are the fewer iterations will be required. For the most common 
types of hypotheses good initial estimates are given by the factor transformation 
methods proposed by Lawley & Maxwell (1964). From the initial point, it is 
usually best to perform a few steepest descent iterations before employing the 
method of Fletcher & Powell. Steepest descent iterations have been found to 
be very effective at the beginning when one is not very close to the minimun™ 
They enable one to obtain better approximations for G and for the initial E 
matrix. - 

When maximum-likelihood estimates of the free parameters have been 
found and F has been minimized, it is possible to test the hypothesis represented 
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by eqn. (2) with its specified values for the fixed Parameters. 
value of Fis multiplied by the factor 


n—(2p+5)/6, 


and the result is treated as a NX 
freedom is 


The minimum 


Variate for which the number of degrees of 


PIP +Rp+R+I) +. 
This number is positive Provided that inequality (6) is satisfied. 
is accepted or rejected according to whether the value of y2 
chosen significance level. 
‘The method of this section has been 
Joreskog & Gruvaeus (1967). The Progr: 
computer. 


The hypothesis 
is below or above the 


Programmed in FORTRAN Iv by 
am has been tested on an IBM 7044 


5. ANALYSIS OF SoME EMpPiRicar DarTaA 


Scussed, some da 
Swineford (1939) are used. The data consist of 26 Psycholog 


tered to 7th and Sth grade children in two different schools. 
White sample is used here. Thi 


of sizes 73 and 72 respectiv 
confirmation sample. 
above, where miscellan 


‘To illustrate ta of Holzinger & 
ical tests adminis 
Only the Grant 


in the reference 


TABLE 1. ‘TEsTs oF HypPoTHsEs CoNcERNING NumsER oF FAcToRrs ExPLORATION SAMPLE 
Degrees of 
hk x freedom Ey 
0 253.00 36 < 0-001 
1 101:30 27 < 0-001 
2 46:39 19 <0:001 
3 5:14. 12 0-95 


The correlation matrix of the exploration sample w 
the UMLFA Program of Joreskog (1966 b). The hypotheses that the number 
of factors was 0,1, 2, and 3 Were Successively tested. 

of degrees of freedom and Corresponding Probabiliti 


Obably wise t Cc 
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interpretation of the data this solution was rotated orthogonally using the 
method of Kaiser (1958). The varimax solution is given in Table 3. * 
the sample size is rather small, sampling variability is very large. Hence i 
factor loadings larger than 0-30 in absolute magnitude are interpreted. It then 
seems that the first factor, determined by tests 1, 2, 3 and 9, is a visual factor, 
the second factor, determined by tests 4, 5 and 6, is a verbal factor, and that the 
third factor, determined by tests 7, § and 9, is a speed factor. 


TABLE 2. UNROTATED MAXIMUM-LIKELIHOOD SOLUTION FOR EXPLORATION SAMPLE 


i Ain Aig Aig Wii 
1 0:59 0-14 0-37 0-49 
2 0-37 —0-19 0-45 0-62 
3 0:42 032 0-53 0:44 
4 0°71 = 0-37 — 0-27 0:29 
5 0°71 — 0:26 0-23 0:37 
6 0-74 0-33 — 0-17 0:33 
7 0-50 0:58 = 0:30 0:32 
8 0:65 0-54 0°13 0:27 
9 0-64 0:34 0:27 0:40 


TABLE 3. VARIMAX ROTATED SOLUTION FOR EXPLORATION SAMPLE 


Factors 

Variate Visual Verbal Speed 
1 0-61 0-30 0:23 
2 0-60 0-12 0:06 
3 0-72 0°18 — 0:02 
4 0°17 0:82 0-11 
5 0°18 0-75 0°21 
6 0-26 0:76 0:16 
7 0-22 0:22 0-76 
8 0-21 0°12 0:82 
9 0:40 0-14 0.65 


‘To examine further the reasonabl 


1 J eness of this interpretation, the following 
target matrix for A is set Up: 


# ESOS EE 
SOOERESOoSoS 
# #8 ooo osooo 
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The matrix of Table 2 was transformed into an obli 


ডা Agreement as possible with this target. This was accomplished by use of the 
alan of Lawley & Maxwell (1964). The method transforms the factors in 
such a way that, for any column of the target matrix, the ratio of the sum of 
squares of loadings corresponding to zeros in the target to the total sum of 
les is minimized. ‘The solution is given in Table 4; itis evidently a refine- 
ment of that given in Table 3. With a few exceptions the small loadings have 
become smaller and the large loadings have become larger. 
"Ye TABLE 4. ‘TRANSFORMED SOLUTION FOR EXPLORATION SAMPLE 
y Factors 
Variate Visual Verbal Speed . 
0-60 0-14 0-14 Factor correlations 
oD 0:63 = 0:02 = 0:00 Visual Verbal Speed 
3 0-74 0-03 =0'11 Visual 1:00 
4 = 0-02 0:88 = 0-06 Verbal 0:45 1:00 
5 0-01 0-77 0-05 Speed 0:43 0:36 1:00 
6 0-10 0-78 0-00 
# = 0-24 0-14 0:78 
8 0:26 = 0:09 0:82 
্ধ 9 044 _০0.08 0:63 
বু TABLE 5. RrsTRIcTED MaxiMum-LixsLtno0p SOLUTION FOR CONFIRMATION SAMPLE 
Factors Residual 
Variate Visual Verbal Speed Variance 
1 0:68 ox O* 0-54 | 
2 ‘3 * al ! actor Corre, ations 
0:34 0 0 0:88 F l 
| 2 0-66 0* O* 0-57 Visual Verbal Speed 
bd 4 0* 0.91 O* 0-18 Visual 1:00* 
5 O* 0.87 0+ 0-25 Verbal 0-55 1-00% 
5 fp 0 oe 0* 0-32 Speed 0-47 0:09 1.00% 
0-65 0-58 
8 (0) 0* 0.93 0.15 
9 0:67 0* 0.19 0-39 
Asterisks denote Parameter values specified by hypothesis, 
ত The results obtained Suggest the hypothesis that the ni 


Program of Joreskog & Gruvaeus (1967). The maximum-likelihood solution 
under the hypothesis is given in Table 5. 


Since the value of X* is 29.96 with 23 degrees 
; probability of 0-15. 


> 
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It should be noted that the solutions of Tables 2-4 are three EN 
unrestricted solutions in the same factor space. They fit the observed baa oa 
tions equally well. The solution of ‘Table 5, on the other hand, is a a l 
one. The number of fixed parameters is 20, which is 1 more En ] 7 
necessary for uniqueness. ‘The restrictions affect the estimation of the SE 
variances. The differences between the residual variances in ‘Tables 2 and £ 
are therefore not entirely due to sampling errors. 


‘The above example has been given mainly to show how the দিলাম 
described may be put to practical use in cases where the hypothesis isnot specihe k 
Prior to the analysis of the data. If the hypothesis were set up in advance, one 
could proceed directly to the confirmatory stage. 


ACKNOWLEDGEMENT 


Part of this work was Supported by a grant (NSF_GB 1985) from the 
National Science Foundation to Educational ‘Testing Service. 


REFERENCES 

FLETCHER, R. & PowrLL, M. J. D. (1963). 
minimization. Computer 3. 2, 163-168. 

HoLzINGER, K. J. & SwINEFORD, F. (1939). 
of a Bi-factor Solution. University 
Monographs, No. 48. yu 

JorEskoc, K. G. (1966 a). ‘Testing a simple structure hypothesis in factor analysis. 
Psychometrika 31, 165-178. 

JoREskoc, K. G. (1966 6). UMLFA—A com 
likelihood factor analysis. 
Educational ‘Testing Service. , 

JorEskoc, K. G. (1967 a). Some contributions to maximum likelihood factor analysis. 
Psychometrika 32, 443-482. 

JoREskoc, K. G. (1967 b). A general approach to confirmatory maximum likelihood 
factor analysis. Research Bulletin. Princeton, N.J.: Educational "Testing Service. 

JoREskoc, K. G. & GRUvVaAEUS, G. (1967). 


ৰ Lu 
A rapidly convergent descent method fo 


A Study in Factor Analysis: The 0 
of Chicago: Supplementary Educationa 


a a ) 
puter program for unrestricted maximum 
Research Memorandum 66-20. Princeton, N.J.: 


Ff ৮G RMLFA—A computer program for restricted 
maximum likelihood factor analysis. Research Memorandum 67-21. Princeton, N.J.: 
Educational ‘Testing Service. # 

Kaiser, H. F. (1958). The varimax criterion for analytic rotation in factor analysis. 
Psychometrika 23, 187-200. 


LAWLEY, D. N. (1940). Th 


€ estimation of factor loadin 
likelihood. Proc. Roy 


£s by the method of maximum 
. SOc. Edinb. (4) 60, 64-82. 


LAWLEY, D. N. (1967). Some Dew results in maxi likelik act. lysis. Proc. 
Roy. Soe. Edinb 4) 67, 256-264 mum likelihood factor analy 

LAWLEY, D. N. & MaxwELL, A. E. (1963). Factor Analysis as a Statistical Method. 
London: Butterworths. 


LAWLEY, D. N. & MaxweLL, A. E. (1964). 
statist. Psychol. 17, 97-103. 

McDonNaLp, R. P. (1967). Factor interaction in nonlinear factor analysis. Br. J. math. 
statist. Psychol. 20, 205-215. 


Factor transformation methods. Br. 3° 


‘The maximum likelihood method ন 
the problem of improper solution 


tatistics, University of Uppsala, Sweden.) 


Cd 


(w) and its mean a 


Vol. 21 The British Journal of Mathematical and May 
Part 1 Statistical Psychology 1968 
97-103 


MOMENTS OF THE STATISTICS KAPPA AND WEIGHTED KAPPA 


By B. 5S. EveRIrT 


Institute of Psychiatry, University of London 


of diagnostic Categories. 
Subjects with regard to a 


the Observers. ‘Traditio 


It has also been noted (S 
of the relative Seriou 


ctween the two Observers, 


: eighted ka a 
Nd variance are also Considered. lb i 
G 
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2. Kk AND kw 
The data appear in the following form: 


Diagnostician A 
Di Di ws Di 


D; my Mig ee Mjh ny 
D may m 5 ng 
oe 21 Mae 2 
Diagnostician B bl 
D mp Gun: ‘ua: FUE np 
h 1 


M, Mj ... My N 


Di (i=1.,..., h) are a set of diagnostic categories, 
mig (J =1...., A) is the observation in the ith row and jth column, 
1; are the row totals, 


HM; are the column totals, and the total number of observations is N. 


The formulae for «and kw are as follows: 


=(Po—PY(1- Po), (1) 

h Ah h h yy 

=1-( 3 > WijPoij PS WuPey), ( ) 
i=1 §=1 i=1j=1 


where Po= observed proportion of agreement, 


P= chance expected proportion of agreement, 
Poy =observed proportion in cell (i, j) and 
Pej =chance proportion in cell (i,j). 


In terms of the mij, n; and Mi; these are: 


P= Ly lS 
0= NL, mis, P= Ne 2 him, 
Pye L 
014 = N° Poij= Nad. 


. The Wij are a set of weights indicating the level or seriousness of the 
disagreement. ‘These are arbitrary and chosen by the experimenter. For 
example, In an experiment to measure the agreement between two psychiatrists 
in allocating patients to different diagnostic categories, a disagreement such a8 
neurotic disorder versus Personality disorder is not a serious as one such a8 
psychotic disorder versus neurotic disorder, and so the first type of disagreement 


would be allocated a lower weight than the second. (For the diagonal cells the 
weights are, of course, zero.) 


om AEE 


— 
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k May be interpreted as the Proportion of agreement Over and above that 
expected by chance. It can be shown to have almost the same value as the 
product-moment Correlation coefficient for ‘the dichotomous Case. The 
advantage of ,¢ appears to be that it is more Intuitively reasonable, and also that it 


leads to kw, thus allowing for differences in the seriousness of disagreement. 
s 


3. MEAN AND VARIANCE OF « AND kw 
‘The probability density function of the mij is given by: 


h Ah A 
Pmy)= Tans! TEM! i I mylIN!. (3) 

i=1 j=l i=1 

3=1 
From eqn. (3), the means, variances and covariances of the m 
finding the factorial moments of the distribution. How 
may also be found in the following Way without as 
factorial moments. 


The marginal distribution of a Single mij can be Shown to be a hyper- 
geometric distribution given by 


(Nn) MIN _ Mg)! 
ly) = Nims — my =n) NN 4) 


=_ 7 — M; + mij)! | 
‘Therefore the mean and varia 


iY Can be derived by 
ever, the covariances 
suming any knowledge of 


nce of mij are 


nM; q n(N-— n)M(N — M,) 
N hie NAN-T) 
Tespectively (Mood & Graybill, 1963, p. 110). Also the Sum of any number of 
the m,jyis a hypergeometric variable, and using this fact the Covariance of any two 
of the ij can be derived. ‘The covariance of mj, and #is can be found as 
follows. 
Consider T= 


mij +m, then 


var(T) = Var(m,,) + var(m, 2) +2 cov(m,,, mo). 


(5) 
Now Tis a hypergeometric Variable with Variance given by 

va(T)= u(N-ni)(M, + Mi)(N — M,-M;) | (6) 

NAN- 1) ’ 
and the variances of Mj) and m,, are: 
n(N-n)M (N-M ) 
Var(m,;) = মম Ne TEE (7) 

and 


var(m,,)= "(NV NAN 2. (8) 
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Therefore using eqns. (6), (7) and (8) it is possible to solve eqn. (5) to give 
—-nMMAN-—n,) 
NAN-—1) l 


Cov(m;j1,mj2) = 


Similarly, for any two elements in the same row, 
—-nuMpMyp; s(N —m) 
N*AN-—-1) 


লট? iy ; টু an be 
In a similar way the covariance for any two elements in the same column can 
shown to be 


C 
Cov(mtp, mt pi s)= (s#0). (9) 


—nint;sM5(N — My) 
N*AN-1) 

For diagonally opposed elements, the same type of procedure is used. For 

example, to find the covariance of mj, and mss, or of mj and ms; (these two 

covariances are equal, as may be seen by considering the probability density 

function of the mij), the following procedure can be followed. 


Consider T=mj, +m,2-+ma21+m22. Then 
var(T)=var(mj;) + var(m; 2) + var(msg1) + var(ms22) 


+2 cov(m;1,m;2) +2 cov(m;,, m2) 4+ 2 cov(m; 1, ms25) 


(520). (10) 


CoV( mip, mt sp) = 


+2 cov(m, 2,31) +2 Cov(m, 2, m22) + 2 COV(m sy), M22). (11) 
Again T is a hypergeometric variable with variance given by 
var(T)= (0, +n2)(N —n, —n2)(M; + M3)(N — M; — Ms) | (12) 
NAN-1) 


‘Therefore in eqn. (11) the only unknowns are Cov(m;;, m2) and cov(m; 2, 21) and 
as they are equal they may be found from eqn. (11). ‘They are given by 


nnaM,Ms 


CoV(m; 1, m2) = COV(m, s, moi) = NAN-—1) - 


Similarly, for any diagonally opposed elements, 


Cov(mst, Ms, w tu) = ER (w#0,u20). (13) 


Using the results of eqns. (9), (10) and (13), the mean and variance of the 
statistics « and kw can be derived. 


(a) Mean values of « and kw 


1 
E(x) = I=-F; {E(P,)-P.} (since Pe is fixed by the marginal totals), 


where E denotes mathematical expectation. 
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Now, 
1A 1h nM; 1 A 
= =A RTD nUuMi=P.. 
E(Po)= NZ E(mj;) 2 N NL iM; c 


Therefore the expected value of tis zero. 
The expectation of Kw is given by 


A Ah —1 A A 
Bu) =1-( > i) 5( > WP); 
i i=1 j=1 


i=1j=1 
and 


h h h A 
#5 2 WiyPoy) = > 3; WyE(Poy) 


i=1 §=1 


t=1 j=1 
Ah M h 
=Xস > Wy = WigPoij. 
i=1j=1 Ne i=15= 
Therefore the expected value of kw iS also zero. 
(b) Variances of « and Kw 
‘ 1 
var(«) = TH var(P); (14) 
{ ra 
var(Po) = ল্য { স Var(my) +2 Sy bh comm) } E (15) 
NUS i=1 5-1 
Hence, using the values derived for the Variances and covariances of the mij, the 
Variance of pe may be found. 
hh =2 AA 
প্)-( সি ৯ WiyPay) var (22 WP), (16) 
i=1j=1 i=1j=1 
Ah A ff 4 
Var (২ ড্র WijPoig = 1 চা Wigtvar(m,;) +2SW;; Wis Cov(miy, )} # (17) 
i=1 j=1 i=1j=1 
Where S denotes the i 


Hence it is Possible to 
eqns. (14) and (16) are both 


The variances given in 
S0 easily calculable a 


» especially that of kw, and 
are given below. 


4. APPROXIMATE VARIANCES OF K AND ky 
If it is assumed that P, and Pyij are binomial Proportions, then it follows 
that 


var(¢) = s Poll Ps 18 
UP RN (18) 


h h -2 0h h 
(= (Y WuPey) (22 WrPa(t Poy) (v.09) 
i=1 j=1 i=1 j=1 


, 
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5. NUMERICAL EXAMPLE 


Diagnostician B 


D, D, D, 
D, 106600) 100) 46) 120 
Diagnostician A D, 22 (9) 28 (0) 10 (3) 0 
D, 26) 1206) 60) 20 
130 50 20 200 


‘The figures in brackets in the top right-hand corner of each cell are the EE 
for each cell. For example, the weight attached to a disagreement of the kinc 
Di; vs. Dis 9. For this table and set of weights, «= 0.4286, w= 5071. 


(a) If xis used, 
A i aa 1 (0-700) . (0-300) 
Pproximate variance চনত 200 
h 1 1 
(10.473): (200) Go: 
x (120.80.130.70 + 60.140. 50.150 + 20.180 .20.180 


+2.120.130.60.50 +2. 120.130.20.20 +-2.60 50.20.20 
= 0.0031. 


= 0:0038; 


Exact variance = 


(b) If tw is used, 


Approximate variance = en | 
RF = (G-5I5): 200 
X {81(0-05)(0-95) 4- 25(0:02)(0-98) + 81(0-11)(89) 
+ 9(0-05)(0-95) +25(0-01 )(0-99) + 9(0-06)(0-94) } 
= 0-004425. 
The exact variance is very 
It can be seen that th 
for smaller values of N 
variances will be larger. 


tedious to calculate, but it can be shown to be 0-004417. 
€ approximations in this case are very good. FOWeve ° 
» the difference between the approximate and exac 


6. SUMMARY 
The means and variances of the 


in measuring observer agreement, we 
to the variances were also given, a 
large number of observations these 


statistics, kappa and weighted kappa, usec 
re found. Easily calculable approximation’ 
nd a numerical example suggests that for 
approximations are very good. 
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DERIVING COEFFICIENTS OF RELIABILITY AND AGREEMENT 
FOR RATINGS 


By A. E. MaxwrLL 
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and A. E. G. PILLINER 
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1. INTRODUCTION 
When a number of Psychologists or 
Eroup of patients, using a Previously agreed schedule of sSympto. 


Surveys, and so forth, but they will 


ew the same Sroup of patients to assess the 
Presence or absence or the degree of Severity of (a) a single symptom; 


(6b) a number of Symptoms which may not be mutually independent. 
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II. A number of raters interview a single patient to assess the presence or 
absence, or the degree of severity of a number of symptoms, which again 
may not be mutually independent. (This is a special case of Ib, but 
it is informative to consider it separately.) 


Discrepancies between the results given by raters may vary In a ঠক কৰ শা কন 
Ways. For instance, in the case of Ia, there may be bias in the results in the 
sense that one rater may systematically record the symptom as being present 
more frequently or as being more severe than does another rater. There may 
be interaction between raters and patients in the sense that a rater’s biases for 
individual patients may be larger or smaller than his average bias over the whole 
group. Though fair agreement between the raters may exist, discernible 
patterns may be recognizable in the disagreements, i.e. it may not be legitimate 
to assume that disagreements, or ‘errors’ are randomly distributed (see Fleiss, 
1965). Possible methods of analysing the data to take account of different 
eventualities are discussed below. 


2. CONTINUOUS DATA WHICH CONFORM TO A TWO-WAY ANALYSIS OF VARIANCE 
MopEL 

In situations in which degrees of severity of symptoms are measured on a 
rating scale (e.g. Hamilton, 1959), or other ‘continuous ’ measuring device, 
Where ‘ scores ’ are allocated to each patient, an adequate analysis of the data, at 
least in Case Ia above, can generally be achieved by well-known methods of 
analyses of variance. In this approach, transformations of the data to achieve 
stability of variance (Bartlett, 1947; Box & Cox, 1964) may often be helpful in 
meeting the assumptions of the model. 

Taking up suggestions made by Jackson (1940), Hoyt (1941) used a 
two-way analysis of variance on a matrix of scores obtained by f persons on each 


of g questions. He showed that the correlation between the f totals over the g 
questions could be estimated by: 


=(P-PO)JE, 
in which P is the mean square for 
interaction mean square. 
‘“ prophecy ’ formula in rev 
individual question totals: 


"=(P-BO)I{P+ (g- DFG}. 


To illustrate the analysis, it will be assumed that t randomly chosen raters 


interview a sample of 1 randomly chosen patients and rate each patient on a 
scale measuring the severity of 


1 INE a single symptom, X (Case Ia above with a 
ft continuous variable). The results could be set out as in Table 1 where the 
score X;; in the body of the table is that given to patient i (i= 1, 2, ..., 1) by rater j 
(f=L 2.5). 


persons and PO the persons x questions 
It is an easy matter to use the Spearman—Brown 
erse and obtain for the average correlation between 
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TABLE 1. SCORES ALLOTTED BY t RATERS TO 7 PATIENTS ON SYMPTOMS X 


Raters 

Patients LU Biss fF = {£ঁ Totals Means 

1 Xun ee Xj EL Ku P; 2, 

2 

H Ki Xj Kit Pi Pi 

n Ha awe BH an HE Py Py 
Totals EE as Dj; .. Di Ft 
Means D,; D; nD ন 


A two-way analysis of variance of the data is shown in ‘Table 2. In this 
analysis the estimated mean squares, denoted by D, B and DE are obtained by 
dividing the sums of squares by their respective degrees of freedom. The final 
column in Table 2 gives the expected mean Squares for the several components of 
variation: oo is the residual or ‘ error ° Variance; c,2 is the variance of patients’ 
mean scores, while co f* is the variance of the mean scores for the raters. On the 
assumption that the variances of the columns of data in Table 1 are homogeneous, 
bias between the raters can be tested by the F-ratio D/DP. 


TABLE 2. TWo-WaAy ANALYSIS OF VARIANCE OF THE DATA IN TABLE 1 


Source of Degrees of Sums of Mean Expected 
variation Sreedom squares square mean square 
Raters t~—1 ny (BD; —T) D 00° + nog: 
[ 
Patients n~-1 (P;- Ty: BP 00° + top: 
n 
Residual (1-1) (1-1) by difference DP 00° 
Total nt—1 SD K;j-7T): 
tn 


A measure of the overall agreement between this random group of t raters 
and another random group of t raters is given by the ratio: 


2 toy |(to,2 ন 00°), 
which can be estimated from the mean squares as 
n=(P-DB)/E. (1a) 
A measure of the average agreement between one rater and another is given 
by the ratio 


P2= 0p |(0p2 + 002), 
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which can be estimated from the mean squares as 
ra=(B—-DP)I{P+(t- UDP}. (10) 


These 7’s are significant if the F-ratio PDP shows £ to be significantly greater 
than DP. 


3. CONTINUOUS DATA: THREE-WAY ANALYSIS OF VARIANCE MODEL—CASE Ib 


Consideration is now given to Case Ib in situations in which rating scales 
are employed. This case could be dealt with by performing separate analyses, 
of the type outlined in Section 2 above, for each symptom. But since the 
symptoms are unlikely to be independent of each other, an analysis of the data 
Which simultaneously takes them all into account is generally more informative. 
Therefore it will be assumed that t raters (J=1, 2, ..., 1) interview the same 1 
patients (i=1, 2, ..., n) and rate each on k symptoms (h=1, 2, ...,R). It will be 
further assumed that the rating scales for the several symptoms are similar in 
range and, for the purposes of the model, that all the raters replicate their 
ratings for each of the k symptoms m times (=1, 2, ..., m) in the investigation. 
The data are set out symbolically, for a single replication, in Table 3. 


"TABLE 3. SCORES ALLOCATED BY EACH OF t RATERS TO n PATIENTS ON Rk SYMPTOMS 
First Rater t-th Rater 

Symptoms 1 Esc YF SE 1 2a YF was 

Patients 
1 Xn oe Xin NE. < NE Yi, as Lh ae Yk 
2 
i Xin Yin 
n নললে 

Means Xx “ss Xj হজ Xi “as bl is Yn te Y.p 


A three-way analysis of variance model with replication is now considered. 


Tt is assumed that the score of patient i on replication gq of symptom h for rater j 
can be represented as: 


| Ninja= a+ Bit ya + 8+ By + BOs; + yO + Byding + emngo- 2) 
In this model, ais the general mean; B; the contribution specific to patient 1; Yn» 
the contribution specific to symptom Ah; 6; the contribution specific to rater J ; the 
next four terms in eqn. (2) are the interaction terms, while e.,.. is replication 
error. One purpose of the analysis would be to see i there jg ble 


bias between the 
mean scores allocated by raters, to see if some symptoms are scored more highly 


than others, or if there are significant interactions, etc. Another purpose would 
be to derive from the analysis coefficients of agreement among raters 


' 


* between raters. A distinct advantage of the mult 
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For the purposes of the analysis the most general case will be a 
which raters, symptoms and patients (referred to in the ) y the Ie I 
D, 5S, and P, respectively) are all considered to be random SaAp Ep 
populations. ‘The analysis of variance of the data would now হ Ee the orm 
shown in Table 4. For reasons of space, the ‘sums of squares "column is 
omitted, but these would be calculated in the usual way (e.g. Winer, br 
P. 171) or would more probably be obtained on a computer. In the fina 
column of the table, og* stands for the variance between Raticnts; 0,* for variance 
between symptoms; os for variance between raters. Variances due to কাৰ 
actions are indicated by o2 with appropriate subscripts, while oz stands for the 
‘error’ variance. After making appropriate assumptions about which. are 
‘random ’ effects and which are ‘ fixed’, the expected mean squares a “ 
the appropriate error terms to use for testing main effects and interaction effects. 


"TABLE 4. A THREE-WAY FACTOR ANALYSIS WITH REPLICATIONS 


Source of Degrees of Mean 
variation freedom squares Expected mean squares 
Patients (P) (n-—1) P 00° +mopy,s + mtogy,* + mkoss® + mhktog* 
Symptoms (S) (k-—1) S 00° + mopys® + mtogy,* + mno, s+ mtno,* 
Raters (D) (t—-1) D 00° + mogy,s® + mhkogs® + mnoys* + mkno gt 
DS (t-1) (k-1) DS 00° + mMopys* + mno,, 
DP (t~1) (n-—-1) DEP 09° + mopys® + mkogs? 
SP (k-—1) (n—1) SPF 00° + mopys* +mtogy* 
DSP (1) (R—1) (n—1) DSP 00+ mogys? 
Replications tkn(m—1) E 00° 


In a recent paper Fleiss (1966) considers a multivariate approach to the 
problem discussed above and provides a likelihood ratio test for assessing bias 
amongst raters. A criterion is also given for deriving measures of agreement 


ivariate approach is that it 


takes into account possible correlation between the variables. 


4. COEFFICIENTS OF AGREEMENT DERIVED FROM THREE- 
VARIANCE 


The use of information provided by the analysis of variance for deriving 
coefficients of agreement between raters is less well known than its use for com- 
paring means and this will be given in greater detail. Without loss of generality, 
the mean of the data can be shifted to the Origin, so that a= 0. From eqn. (2) 
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the total score for patient i over k symptoms each replicated m times and rated by 
t raters is Obtained by summing over J, hand g. Jt is: 


Xi= SYD xyna=himBi+ tm S yntkm S 6j+1m S, Byint km >, By 
jhg h 5 h 3 
+m 25 yd+m SS Bydnst ZXS ene © 
LE hj Ajo 


From eqn. (3) the expectation of the variance over i of Xi is: 
E(V)=Rt mop? + kt°m2og,* + ketm2og,> + Ritmo, + Rtmoo:. 4) 


Now suppose that the whole experiment is repeated on a second occasion, Ue 
+ the same symptoms, raters and patients. Then E(V) is as before, while t 
expectation of the covariance between the m repeats is everything in eqn. ডি 
except the final term, since errors, if random, should not contribute to the 
expected value. The expectation of the correlation between the Xi: values In t 
experiment and in the hypothethical repeat experiment is thus given by: 


2 2 
j= Rtmog* + tmog,* + kmogs* + moe, (5) 
Rtmog* + tmop,* + hkmogy + mMop,s + To 


If replication were possible, then all the variance components in eqn. « 
could be estimated from a set of data using the analysis of variance given in 
Table 4. But, in general, replication is impracticable so that m= 1 and a direct 
estimate of cy* cannot be obtained. Consequently, the second order interaction 
term DSP, which, if m=], is an estimate of (0° + op,s2), has to be used as an 
estimate of oy“. This is equivalent to assuming that cp, is zero—generally 2 
reasonable assumption. Eqn. (5) now becomes: 


E(p)= htop + tog, + kos | (6) 
htog* + tog, + hops + To 
° in which op* is estimated by DSP. The same result is obtained from Table + 
modified to correspond to the experiment conducted without replication. First 
the ‘replication ’ line is dropped. Next, all the m’s in the expectations are 
deleted. Finally, on the assumption made in the text that cs,2=0, this term ডি 
deleted from all the expectations. Eqn. (6) then follows at OLE 
An estimate of the coefficient of agreement between (i) the whole set of t 
raters over all k symptoms and (ii) the same set of t raters over the same 
symptoms can now be obtained directly from the means squares for patients 
and the second order interaction term DSP using the equation: 


n,=(P- DSP)/E. 0) 


Eqn. (7) is the algebraic equivalent of e 


d 
5 qn. (6), when the estimates of the expect 
mean squares derived from a set of d 


ata are used. 


mw———~——_———— > — 
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If an estimate is required of the average coefficient of agreement of a single 
rater with himself over & symptoms, then we put t = 1 in eqn. (6) and, after some 
algebra, obtain: 

_ BP+(t-1)DP-—iDSP (8) 
P+(-UDE 


ra 


Finally, if an average coefficient of agreement of a single rater with himself on a 
single symptom is required, then t=k=1 in eqn. (6). In this case the formula 


in terms of estimated mean squares becomes: 
P+(k—-1)SP+(t—-1)DP_(k+t-1DSE 


$7 Bi (hR—DSE+(0-—DDE+R—UD(C— DDSE 09) 


All the eqns. (6) to (9) relate to self-consistency—the agreement of a group 
of raters with itself, or of a single rater with himself. © It is perhaps even more 
realistic to ask: what is the coefficient of agreement between one group of raters 
with another group, or one rater with another rater? 

Returning to eqn. (4), let us assume that the experiment is repeated using 
the same Symptoms and patients, but a different random sample of t raters. 
Then E(V) is as before, while the expectation of the covariance between the 
repeats is the first two terms in eqn. (4), the other terms with 5's explicitly or 
implicitly in the subscript being excluded. Putting m=1, and again making 
the assumption that 0gy9 = 0, we Obtain: 


= htop: + topy* | 
Rtog* + top, + hogs + 00% 


E(p) (10) 

From eqn. (10) and the expectations of Table 3 we obtain at once an estimate 

of the coefficient of agreement between (i) the whole random sample of ? raters 

Over all k symptoms, and (ii) a further random sample of t raters over the same 
Symptoms. It is: 


rs=(B—-DB)/E. (11) 


An estimate of the average coefficient of agreement between one randomly 
selected rater and another over all k symptoms is obtained after putting t=l1 in 
eqn. (10). In terms of mean squares this estimate is: 

rs=(P-—-DP)[{B+(t-1)DP). (12) 


Finally, if an estimate is required of the average cocfficient of agreement 
between one randomly selected rater and another over a single symptom, we 
put t=R=1 in eqn. (10). In terms of mean squares this estimate is: 

Ee B-DP+(k-1) (SP— DSB) (13) 
a ৰ — 
B+(kR-1)SP+ (t—-DDP+(k-1) (t—-1)DSP 
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5. THREE-WAY ANALYSIS OF VARIANCE MopeL—CaAse II a 
নং atients 

Case II will be Considered in the first instance for t raters, n Ca CBE 

symptoms and mm replications and then n will be made unity to in (2) া 

Solution required for a single patient. The basic equation is again eqn. Taiag 

the roles of ‘ patients ’ and ‘ Symptoms ’ will now be reversed and, by তা: fe 

over j, 1 and g, the total score Xn for symptom h obtained. This leads 0 

following expression for the expectation of the variance of the Symptom tota!s, 

namely, 

£E(V)= thm, 2} Imno, 2+ tmeno,, 

If the hypothetical re 

random sample of t rater 

first and third terms in 
agreement between the 

Inmo,2 + mop, 


E(p)= (15) 
(0) inmo,* + tmop,2+ mno,g* + mos, 2+ co’ 
which with m=1 and os, 


2 Im°no,, + tmnop. (14) 


Peat of the investigation is taken to involve a a 
S, the expectation of the Covariance is the sum of t 


eqn. (14) and the expected value of the coefficient of 
two samples of t raters is: 


2 =0 gives the mean Square estimate: 


r= (5 DS)/s5. 
For the particular situation in which t raters rate a 
symptoms, m=1, n=1 and 09,5 =0 in eqn. (15). 
coefficient of agreement between raters 


(16) 
single patient on hk 


‘The expected value of the 
now becomes: 


a Ho, + py) 
o-oo 
Which can be estimated by 
S_DS+ (0-1 (FED 

r= 8 DS+(0 ) (SE SP). og) 
S+(n-1)SP { 
€ rater rates a 
sys =0 in eqn. (15). 
Ween two randomly sel 


single patient on k symptoms 
The expected value of the co- 
ected raters is now: 


2 2 
Eto) 0, Tt 05, 19 
(0) 0 Ho 08, + 0, বণ oy ’ ‘ ! 
for which the estimate is 
r=  S+D5+ 0-1) (SE_ DSB) (20) 

+0 1)SPL GDS 1) (— 1)DSE' 

If only one patient is available for t 
be used in Place of the Previous three 


two-way analysis must 
be of the form depicted in Table 1 wi 


This restricted analysis will 


th ‘Patients ? replaced by ‘ Symptoms ’. 
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In this case, the expected val 


ue of the coefficient of Agreement between two 
random samples of k raters is: 


= to, টি 
£(p)= io? (21) 
Which can be estimated by 
S'_ DS’ 
Uae Re (22) 
ৰ 


and DS" are mean s. 
coefficient of 
Obtained by putting t=1 in eqn. (21). 


quares in this two-way analysis. 


domly selected raters is 
The expected Value is: 


2 
Eo) (23) 
hich can be estimated by | 
ib 1 Ee (24) 
5+ (1-155 


ious three-way 


(19) are con- 
hus y= (0,24 09,2) and oy'*= 


£( Vr) = m2k2712 


‘This expectation is, 
for Din Table 4. 


008 + mhhn2o, 2 + mn2h2no, 2 


otal is mhno 2, 
similar argument to t 


H » Eis taken to be the 
interaction mean square. 


Symptoms, the expected value of 
the error variance Of a rater total becomes 

(m*k2no, 2 + m°knoy, 2 + mknoy2). 
The test for rater bias is then F=DI/DEP. 


5.P. 
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jee 30 টী » same t raters 
(iii) If the investigation is assumed to be repeated over tie ke luce of LU 

the same 1 patients, and a different sample of k symptoms, the expe 

the error variance of a rater total now becomes: 


2 
(m*knto, 2 + m2knog, ps2 + mknoy), 


and the test for rater bias is F= D/DS. 


i i igation i ্‌ » same t raters, a 
(iv) If the investigation is assumed to be repeated over the same tr , 


ন নং ্ >» EX ected: 
different sample of n Patients, and a different sample of k symptoms, the exp 
Value of the error variance of a rater total is now 


2 
(m°ktno, 2+ m°kn2o, 2 + m2kn 0gys + Mknoy). 


The test for rater bias is F= D/(DS + DE DSP). The degrees of freedom ar 
troublesome, but if n and k 


) k ionificance of 4 
are fairly large, a decision about the significance of . 
would be sufficiently clear-cut. 


Ti DicHoToMousLy ScoRED Dara 
Consideration is 


now given to Situations in which rating scales are not 

employed but the raters simply indicate whether a symptom is present or absent, ! 

scoring these 1 or 0 ethod of dealing with Case Ia is tu | 
ABT fie well-known Kuder formula KR 20 (DuBois, 1965). 

Uppose t raters (=1,.2; 52, t); ea tients for the sam 

symptom. The Proportions rat. ) Tp i 


h Over all t raters is $2. Then an estimate of the 

en these patient totals a ling totals over a 
n in 

Parallel set of ¢ other raters is ETE eoemeponing 


Ay analysis of variance in which ৰ 
aters ’ and the variate consists of 1's an 
2 particular symptom. 

. i e 
ficient alpha ’ (Cronbach, 1951) in a, 
n the Present context, writing W for hat 
Variance ‘ within raters ’, it is found © 


t 
0 W)/B= t 


Omous rati 
mean square and sp ng. I 
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The Kuder-Richardson formula cannot be employed for dichotomously 
" scored data falling in Cases Ib or II. In these cases the analysis of variance 
methods outlined in Sections 3 and 4 above can be used, provided the eS 
tions underlying the analysis are reasonably fulfilled. In particular, analysis চর 
Variance models would be apposite when the incidence of the symptoms, 
recorded by each rater, is within the range 20-80 per cent for then the be 
$ of the symptoms would meet the criterion of homogeneity fairly adequately. 
ন More recently, Case Ia (and, by repeated application, Case Ib) has I 
discussed by Armitage et al. (1966) and by Fleiss (1965). | Armitage et al. a lopt 
2 Straightforward mathematical approach to the problem in terms of Proportions 
") and Propose three indices of Observer agreement or disagreement, any one ডে 
Which might be used. They are: (a) The mean Majority Agreement INER, 
Measuring the frequency with which observers agree with the majority opinion; 


b) The mean Pair Disagreement Index, measuring the frequency with which 
Nairs of observers disa 


en subjects in the number of observers. Use of 
nd to yield similar conclusions; (b) and (c) are 
d. ‘The authors add the cautionary note that ‘ the 
ed for any index depends strongly on the proportion 
Whole study’. Fleiss (1965) feels that an attack on 


3 


ying such judgements. Using the model, 
Tr which ‘ Cochran’s O statistic is valid for 
atic differences amongst the Judgements of 
the different Observers’. He also Provides a measure of intraclass correlation 


between all pairs of Judgements on the same subject which may be used as a 
measure of concomitant agreement between observers in cases Where systematic 
differences between them are small. 


YPotheses of no system 
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SIMULATING PERCEIVED SIMILARITIES BETWEEN TASTE 
MIXTURES HAVING MUTUALLY INTERACTING COMPONENTS 


By R. A. M. GREGSON 
University of Canterbury, New Zealand 


Empirical findings about the perceived interaction of the components of 
qualitatively complex taste stimuli are reviewed and summarized. ‘The revision 
of multidimensional scaling equations to incorporate a simulation of a mutually 
inhibitory or facilitatory interaction between components is discussed. Data 
from a previously published study on the perception of similarity among taste 
mixtures are re-analysed for illustrative purposes and a description of them is 
shown to be less adequate when provision is made for one form of interaction than 
when this is omitted. A reclassification of stimulus-response paradigms for 
studying taste mixture perception is outlined; and it is suggested that the inter- 
action of taste components may be an artifact of the response measures used. 


1. INTRODUCTION 


When a mixture of qualitatively different tastes is experienced by a human 
subject, it is sometimes reported that the components of the mixture interact. 
For example, if sucrose of a given concentration is present, the perceived inten- 
sity of sweetness of the mixture may be more, or less, than would be the case if 
the sucrose were in a simple solution with no admixture of other components 
which, taken singly, elicit qualitative sensations other than sweetness. Inter- 
action, in this context, is thus an approximate psychophysical analogue of 
interaction in analysis of variance models, in the sense that the set of qualitativel 
distinct intensities resulting from a complex stimulus cannot be predicted Fo 
a linear sum of the intensities which result when the components of the complex 
stimulus are experienced one at a time. p 

The problem has been of recurrent interest to workers in physiolo 
psychology and food technology, because the phenomena of sensory interaction 
between taste components are the stumbling-block for any theory of taste, and 
particularly for any multidimensional scaling models of taste perception which 
aim to map more than a limited part of between-judgement variance. ‘The 
measurement theory problems which arise when an attempt is made to represent 
multidimensional systems involving interactions are not, of course, unique to 
taste. However, in the case of taste the problems are more obvious and perhaps 
more acute. 

Empirically, the problem of taste mixture perception has been studied 

iecemeal for some time (Bujas, 1934; Fabian & Blum, 1943; Beebe-Center 
1959; Pangborn, 1960, 1962; Kamen et al., 1961; Gregson & McCowen, 
1964; Gregson, 1966 a; Indow, 1967). Many different chemical 
f response measures and psychophysical paradigms have 


et al, 
1963; Békésy, 
stimuli and a variety 0 
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been employed. The results are not consistent from study to es Aegon 
due to methodological differences in stimulus presentation ins Ee ne 
subjects may be restricted to one particular form of report bar lois Et 
qualitative and intensity changes of one sensory component in a mixture. 
absolute and relative intensity ratings have been used. | CO sf 
Consistent and reliable differences between subjects in the পপ কে 
sweet-acid interactions (acid making sucrose more sweet or less swect) le 
been reported in more than one of the cited studies; results on other stimulus 
pairings are less clear-cut. Similar directional differences apparently ন) 
depend on a change of method from eliciting absolute ratings of the ET 
intensity of a taste mixture to eliciting relative intensity ratings of that SAPO 
against the same qualitatively identical concentration in an unadulterate 
mixture. The diversity of results suggests that the specification of response 
Constraints, or the related and equally difficult specification of what the subject 
is to judge in the task, is likely to be as critical as the description of the stimulus. 
The earlier studies have been extensively reviewed by Amerine, Pangborn & 
Roessler (1965) but little, if any, attention has been given to the psychophysics of 


the rating task as compared with the amount of speculation there has been about 
the chemistry of the recep 


tor cell environment when complex stimuli are 
employed. 
However, the general finding, possibly subject to minor qualification for 
some taste pairings, is that the prese 
perceived irtensity of some other co Yy itself a qualitatively 
distinct sensation. hate to sucrose both 
ixture less sweet than 


t. (There is a pheno- 


em into one bipolar sensation. This fact 
theory of taste.) ‘The extent 
f the inhibiting component in 


the inhibition can be approxi- 
mated by a positive monotonic function of the ratio of the concentration of the 
inhibited to the sum of the inhibiting 
metric. 


components, given an appropriate response 
This relatively simple ty 


Conveniently referred to as 
considered. In all Pps 


isomorphism betwee 


if mutually inhibitory 1s 
is not the only type which has to be 
ychophysical representations of taste perception a partial 
bstances, most acids, some salts, bitter 
) pli Crose or glucose, hydrochloric or citric 
acid, sodium chloride, Caffein or quinine sulphate }) and {sweetness, sourness, 


Perate at least over any range of concen” 


far niade no explicit provisi 
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2. MULTIDIMENSIONAL REPRESENTATION OF INTERACTIONS 


Interaction could arise as readily from adaptation or frame-of-reference 
effects in the judgement process (Gregson & Paris, 1967) as from any receptor 
cell events. But, whatever its causal locus, provision has to be made for its 
possible occurrence in any psychophysical model of taste perception which 1s 
claimed to be valid. Indeed, by using multidimensional scaling, it might be 
thought easier to arrive at a general assessment of the form or magnitude of 
inter-dimensional inhibitory effects than would be the case from conducting a 
series of experiments in which each possible pair and trio of interactions was 
studied separately. Given sweet, acid, salty and bitter as sensory dimensions 


there are ten such subsystems, (z2)(3). neglecting differences between 


stimulus substances eliciting the same primary sensation. 

Whilst the near-threshold region can usually be avoided in simple uni- 
dimensional psychophysics, it is almost certain to be involved in any stimulus 
series employed in multidimensional scalings on at least some dimensions; 
without such weak intensities on some dimensions in some of the mixtures 
employed, an even spacing of mixtures in a stimulus hyperspace is unlikely to 
be achieved (Gregson, 1965). The locus of the effects of near-threshold 
qualitative changes in the multidimensional scaling experiment could be pre- 
dicted if more extensive data on this breakdown were available than is the case 
at present, because such experiments normally employ a complete paired 
comparison matrix and not all the cells in such a data matrix are equally suscep- 
tible to departures from the predictions of a scaling model which fails to make 
Provision for the breakdown of sensory isomorphism. 


Multidimensional scalings of taste mixture similarit 


Y perception (Gregson, 
1965, 1966 b, c; Gregson & 


Russell, 1965 b; Russell & Gregson, 1966) have so 
on for interaction. This paper describes a model 
with such provision, and discusses the results and problems arising from its use. 

Let a taste stimulus x have t dimensions B=, 2, act }, and let the physical 
stimulus magnitude on the ith dimension be x; for the mixture xeM, where M 
is a set of m taste mixtures between which comparisons are to be made. The 
total sensory input for x is then the set hs 2s, 


ee3 Nis cee) KE} 
" Tf the input is of the form {0,.0; ses 


| » Xi, 0, ..., 0}, that is to Say a mixture 
with one non-zero component, then the response—a 


be Yri(xi), where yi is the operator effecting the unidimensional Psychophysical 
transformation tothe sensory dimension which corres i 
sioni. Various 4 may be merely scalar multiples of each other; e.g. if more than 
One Xj 1S a salt, the relevant Yj are all transformation. 


oy S to ‘saltiness » which are 
known to be empiricall 


y similar (McBurney & Lucas, 1966). 
following Stevens (1961), and using io as a zero lev 


Yili) = oil; —i0)si, 
where « and s are psychophysical constants. 


In general, 
el symbol for dimension i, 


01) 
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2%) > 0, 


J(x*,) < Jilx) 
(If s=2 this Inequality may not be Correct, but it has Never been found to be 
higher than 1.3 in Mpirical studies.) 
Following earlier studies and results (Gre 
lished data i 


by Russell), the Psycholo 


son, 1966 b,c, d: 
et 
€ represented to a fi 
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its face value. A term of the following kind is introduced: 
Blbuffering components [buffered component], (4) 


where Bis a constant representing the extent of buffering effects. The buffering 
components are to be represented by the sum of their effective intensities. 
Expression (4) will be regarded as subtracting from xi and yt, and becomes an 
additive but regular component summating with the effects of the «i. The xi 
are being thus reduced by three conceptually distinguishable additive effects: 
(i) the unidimensional detection threshold, (ii) a buffering due to effects which 
can be expressed in the general form (4), and (iii) residual buffering effects 
specific to individual dimensions, over and above those accounted for by 
expression (4). ‘The two effects (i) and (iii) are represented by the «i. 
Eqn. (4) can be rewritten as 


t~-1 
Bxi= Boar (ni a) > (x7 — 3), (5) 
j= 


where ji. Byiis similarly defined. In eqn. (5) the notation indicates sum- 
mation over t—1 dimensions excluding the ith, and not summation up to the 


(t—1)th dimension. 
Combining the Bxi and By; terms ine 
interaction equation to be considered here: 


Jo, yp)oD™(,y; (th) 
t ur 
=| y (a | (1-0) (mi a— Bi) —(yi- a—Byi) ] f f (6) 


i=1 
which is in (31+ 3) unknowns and predicts all the m2 comparisons over an MxM 
experiment. Hence, for an adequately determined solution with enough 
degrees of freedom, m2 > (3t+3), which is readily satisfied. ‘To find a solution 
set of parameters for a, B, 6 0, 5, and r for a given experiment, it was decided 
first to solve eqn. (3) iteratively, which is, in fact, eqn. (6) with B=0, and then 
gradually move B away from zero in either direction. This strategy exploits the 
fact that interaction effects reported are definitely second-order. If they were not, 
rong risk of getting a different solution this way from that ob- 
tained when proceeding directly to solve eqn. (6) by iterative methods. Thereisa 
problem here, in any case, for the form of eqn. (6) does not exclude the possi- 
bility of multiple solutions to the data matrix matching by virtue of some sort 

and 0 and e together, 


of “trading relation ’ between the effects of parameters B, { 
and it seems that only by using psychophysical knowledge about the relative 


expected magnitudes of the effects mapped into eqn. (6) can the relative merits 


of two solutions be decided. 
Two comments are pertinent: in some 
constant sum for all xeM, a precaution 


ation effects in the scaling task. | : 
nd the high x; lightly inhibited. This in turn IMP 


qn. (3) yields the fullest form of 


there would be a st 


experiments the xi have an approxi- 
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Yilxi) rises more steeply, asa consequence of inhibition, than it would TOErWIsE 
This in turn demands representation in eqn. (3) by a higher Minkowski Ys 
So, in a mixture set with fixed constant effective overall intensity for each mixture 
ve B term should be present, 
» in part, by a raised r. Conversely, if we 
» 7 Should drop to compensate in order to 
This excludes testing eqn. (6) against 
aint on the total intensity of mixtures 


maintain a good fit to empirical data. 

data from experiments Where the constr 

has been effectively applied. 
Secondly, the assumption that B 


is a constant over all 1, J dimension pairs 
is a simplification. If a different v. 


alue is assumed for each 1,] pair, in each 
direction of inhibition (iuponj, or j upon 1) then the problem becomes intractable 
because (12 _ 1) unknowns are added and iterative solutions become interminable 
and untrustworthy. More simply, B might be replaced by a product term 
Bix Bj, where the first constant is associated with the inhibited component and 
the second with the inhibiting component. ‘This possibility is not developed 
here because the available empirical evidence does not Justify the effort in the 
Context of taste, but it is advanced because it might be worth considering in 


intersensory studies, as opposed to interdimensional studies of interaction 
within a modality. 


It is here Proposed to solve eqn. (6) by putting 


H=Iy(w, y)— De, y)]2 


0) 
ry value for H, by a Monte Carlo method. 


Such a value 


and finding a stationa 
is reached when 
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of the Weber fractions on the four taste dimensions involved Over the m2 paired 
comparisons. These values will be predicted to be nearly equal, and have been 
found to be so (Gregson, 1966 4d). The product of the a; is a function of the 
volume of space in which the solution to eqn. (7) is contained ; the limits of this 
volume are given from 


t 
1<TI ais [a*|8, 
i=1 


as shown by Sebestyen (1962). 

Stationary values of H generally occur on a mesa (the surfac 
topped mountain) as Minsky (1961) has Previously observed. 
of constant H-altitude has a perimeter given by the tolerance 
a, 6; ,B,r. The Vector {wa;, w0;, we;, -} (1=1, . 
Surface about the final adopted solution. The Partials of eqn. (8) are slopes of 
ascent to the mesa, and zero on the mesa itself. Fore 
—%Ww'a; to «i+ wai, where w' and w"’ (lower and upper 
tolerances) are not necessarily equal; Wa; summarizes this state of affairs. 
mesa surface for min H in the Parameter hyperspace 
ee ways, neglecting 
the vector: (i) no difference, and B=0; (ii) the altitude 
are moved, and BZ0; (iii) the altitude Stays the same, 
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resultant matrix of 5X 8 =40 cross-modal inter-stimulus distances was generated 
by 16 subjects in each of four conditions, described in the original paper. “The 
similarities between taste mixtures and the mixtures represented by the histo- 
grams were rated by the subjects on a scale from 0 (‘ identical ’) to 8 (‘ impossible 
for the two mixtures to be the same ’). Considering one condition only, that 
designated High Sucrose, Unpaced Matching in the original study, the following 
results relevant to the present paper were obtained. 

The overall mean similarity on the scale, over 40 comparisons, was 4-49; 
the variance of the 40 cell means in the Mx M’ data matrix was 0-263; and 
the estimated reliability of the 40 cell means was 0-563. 

The variance of the cell means is lower than in some unpublished experi- 
ments conducted more recently, and greater than in one other condition in the 
original study from which the data are taken. It varies with response constraints 
such as pacing; its absolute value is not informative as subjects do not effectively 
use all the response information available to them. 

The best fit of eqn. (3), modified for the cross-modal case by subtracting 
a threshold ee, from each xi value and solving for all ei, and putting s=1, gave a 
product-moment correlation between the observed and theoretical similarities 
matrices of 0-807, with 25 d.f. Parameter values selected, all understood to 
have a tolerance of about +01 for a and é, +0-02 for 6, with Minkowski 7=2"0 


were: 
«1 =18 a= —24 9,= —0-16 
a2= 08 €2=04 9,= —0-08 
us=10 3=02 9; = 0-03 
as =09 a=0"4 9, =0-06 


The negative 0, and 0 values are unexceptional; time order effects in taste are 
variously positive or negative. 
Eqn. (6) was modified for the cross-modal case by putting the values of 
Byi and the ei subtracted from yt both equal to zero. In the cross-modal case 
the yi values always refer to the stimulus presented visually; and it is assumed 
that there is no interaction between the histogram columns and no threshold 
effect which would modify their perceived relative Jengths. 
The amount of variance accounted for by this scaling is not very high 
par with resujts in psychophysics. The extent to which 
the variance of data can be accounted for by scaling models seems to be reduced 
as the number of dimensions goes up, and is probably always low for the chemical 
senses. Similar findings in this respect have been obtained for more conven- 
tional multidimensional procedures by Ekman & Engen (1962). 
The solutions to the equations for the 40 cross-modal comparisons were 
calculated on an IBM 1620 computer. ‘The parameter values which were 
explored with the o, e and 6 values given above are set out in Table 1. It is 
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3. DIscUsSION 


Whilst taste component interaction is an established empirical phenomenon, 
no support has been found, when scaling overall similarities between mixtures 
(as opposed to relative intensities of single qualitative dimensions in mixtures), 
for a scaling model which explicitly makes provision for a specified kind of 
mutually inhibitory or facilitatory interaction between components. On the 
contrary, the best fit recorded employs parameter values which presuppose no 
interaction. Either interaction is unlikely, or the model does not appropriately 
represent interaction. 

This finding raises a number of crucial problems about the application 
and interpretation of multidimensional psychophysics for a sensory channel 
which has the properties characteristic to taste. ‘These problems are perhaps 
best exemplified by considering some alternative stimulus-response relations 
which could mediate taste mixture perception. (In each case the relation of 
sweetness to sucrose is taken as a paradigm.) Table 2 lists the cases which it is 
convenient to distinguish. 


TABLE 2 
Effective stimulus Response recorded 
(i) Absolute sucrose concentration. Intensity of perceived sweetness. 
(ii) Sucrose buffered at the receptor Intensity of perceived sweetness. 
level by some other substance. 
(iii) Sucrose concentration relative to the Intensity of perceived sweetness. 


total stimulus concentration, assuming 
cross-stimulus intensity comparisons 
mediated at the receptor or peripheral level 
to be psychologically meaningful. 


(iv) Sucrose concentration, and the Sweetness intensity judged relative to the 
concentration of each other component total sensation level, either (a) as 
present, regarded as superimposed but mediating processes for a final ratio 


non-interacting excitation patterns at the judgement or (b) as a final absolute 
peripheral level. perceived intensity. 

(v) ‘The trace of the sensory response to 
the sucrose component on a first 
presentation, and the response to the 
sucrose component on a second 
presentation, both centrally stored. 


Perceived relative intensity based on a 
sequential comparison of the two 
sensation levels centrally stored. 


Cases (i) to (iv) are appropriately studied by an absolute psychophysical 
method. Case (v), which will also involve one of the procedures (i) to (iv), is 
appropriate for treatment by psychophysical methods involving relative judge- 
ment. Case (v) is also appropriate for description by the multidimensional 
similarity methods associated with eqns. (3) or (6), if it is assumed that the process 
whereby dimensions are pooled into the overall similarity response is independent 
of the interaction process. In eqn. (6) this assumption is made, for pooling is 
described by the parameter 7, and interaction by B. As no constraints are 
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Gregson (1966), based on the earlier work of Torgerson, Shepard, and Kruskal, 
the raw data have been similarities which are only invariant up to a linear 
transformation. 


4. CONCLUSIONS 


The conflicting evidence which has been discussed does not make interaction 
implausible, but it does suggest that interaction as usually reported (psycho- 
physically and not physiologically at the receptor mechanisms) may be an artifact 
of the response measure selected. The non-linear relations between psycho- 
physical transformations (Gulliksen & Messick, 1960) have been virtually ignored 
in comparative surveys of taste interaction evidence, though they have been 
studied for a unidimensional taste (Ekman, 1961). Until these are more clearly 
understood, neither the existence nor the form of taste dimension interaction 
can safely be assumed. 
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NOTES AND CORRESPONDENCE 


NOTE ON A SIMPLE FIXED-POINT PROPERTY OF BINARY MIXTURES 


By J. C. FALMAGNE! 
F.N.R.S. University of Brussels, Belgium 


Definition. A (binary) mixture of two distribution functions, H and K, is a function F 


defined for every real x by 
F(x) =PH®+(1 —-p)K(S*) (0<p<l). 
Obviously, Fis a distribution function. A simple fixed-point property of binary 
0 decide whether a family of empirical distributions could be 


mixtures can be used as a test t 
explained by a two-process model characterized as follows: any distribution in the family is 
the realization of a mixture of two constant distribution functions. Thus to each empirical 


distribution corresponds a value of the parameter p in the definition. 


The following lemma is obvious. 

Lemma. If H and K are defined as above, and if for some number xo, H(xo) = K(xo) = & 
then any binary mixture of H and K has value « at xo. 

Theorem. Let F; and F's, with parameters pi and ps respectively, be two mixtures of 
the distribution functions H and K. If pips and if Fi(x0) = F2(x0) = & for some Xo, then 
any mixture of H and K has value « at xo. 

For example, if F; and F's are continuous, and if their graphs cross each other at a point, 
the graph corresponding to any mixture of H and K must pass through the same point. 

Proof. By the hypotheses of the theorem, 

biH(xo) + a —PIK(xo) =paH(xo) + d= ba)K(xo) 
and thus 
[H(xo) — K(xo)I(pi— 2) =0. 
We conclude that 
H(xo) = K(xo) if Di#Ds- 
‘The lemma establishes the theorem. 

The interest of this theorem from an empirical standpoint is that, even though Hand K 
are unknown, it enables conclusions to be drawn from an examination of the mixtures. 

It is natural to raise a similar question concerning the mixtures of three or more 
distribution functions. As might be expected, more freedom is obtained in this case. 
For example, suppose that F; and Fs are two mixtures of three basic distribution functions 
G, H and K, defined by 

F(x) =DiG() +p HQ) + (1 — pi — DOK(S), 

F(x) =b2G(x) +p HO) + (1 —D2—D2)K(S), 
where the p’s are strictly positive, and pit pi < 1. If, for some xo, Fi(xo) = F2(x0), then the 
following equation can be derived 

(pi— PMG) — K(x0)] + (py — pa L(x) — K(xo)]= 0; 
which does not seem to lead to any result of practical interest. 
Finally, it should be noticed that the property expressed by the theorem holds for 

density functions and, more generally, for probability functions defined for an arbitrary 
space (with, in these two cases, the obvious definition of a mixture). 


1 At the University of Wisconsin during 1967-68. 


II S.P. 
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BOOK REVIEWS 


Logic of Statistical Inference. By IAN HACKING. Cambridge University Press. 1965. 
Pp. ix+232. 40s. 


Statistical inference is a subject in which there are almost as many distinct points of 
view as workers in the field. Professor Hacking introduces here a new contribution. 

‘The first question to be asked about such a contribution is what exactly the author 
means by ‘ statistical inference’. I think it is fair to say that there are two main ways in 
which we may be influenced by statistical information. It may change either our opinions 
or our decisions. Of course, these two aspects are interrelated. Butin the case of decision- 
making we have at least some reasonably objective criteria to guide us. It may sometimes 
be possible to show that one decision is clearly better than another, in the sense of leading 
to greater financial profit or other desirable ends. Professor Hacking makes it clear that in 
his book he is not concerned at all with decisions, but only with beliefs. ‘This is a much 
more thorny question. How are we to know whether one set of beliefs is more correct or 
appropriate than another? If someone asks us, ‘ Who are you to tell me what to believe? 
how are we to answer him? What sort of arguments can we put forward which will persuade 
him that we are right? But if statistics is to help us to come to judgements at all there 
must be some way of proceeding from the data to our conclusions. And anyone who wishes 
to do more than just blindly follow statistical procedures will have to think about this 
question seriously. 

Itis interesting to know what a philosopher and logician will make of current views on 
statistical inference. ‘The book takes into consideration all points of view and topics 
related to the subject. It deals with the definition of probability, sampling, the minimax 
rule, likelihood, hypothesis testing, significance levels, sequential tests, fiducial probability, 
estimation, Jeffrey’s theory, and the subjective theory. ‘The discussion is always tolerant. 
Professor Hacking makes every effort to understand other authors’ theories. It is penetrat- 
ing: where objections are made, they are fairly and clearly made. It is often amusing, and 
enlivened by historical information. The style is lucid, and there are not many technicalities, 
either philosophical or mathematical, which are likely to impede the reader. However, the 
book is written in rather a tentative and exploring spirit, and does not lead to any precise 
conclusions of immediate practical value. Occasionally, one may feel that Professor 
Hacking has allowed himself to be persuaded into a rather odd point of view, although such 
lapses are rare. ‘Thus on pp. 48-50 he considers the following situation (admittedly an 
artificial one, but introduced for the sake of argument). One knows that in a large store 
there are only three different kinds of urn, which contain respectively (a) 99 green balls, 
1 red, (b) 2 green, 98 red, (c) 1 green, 99 red. One is given a large number of urns to 
examine, and they are ‘ produced from different and independent manufactories ’ so that 
« there is no reason to suppose the characteristics of one urn or machine are indicative of 
those of any other’. Furthermore and very importantly, though Professor Hacking does 
not say so explicitly, let us suppose that whoever produces the urns for examination is not 
deliberately trying to catch one out. From each urn one is allowed to select (at random) 
one and only one ball, and then one has to guess which type (a), (b) or (c) the urn belongs to. 
with the sole object of guessing right as often as possible. Common sense would SU gest 
some strategy such as guessing (a) when a green ball is drawn, (b) or (c) when a red ball is 

drawn. ‘The minimax strategy is very different: one takes a (very nearly true) coin, tosses 
it, and guesses (6) if it comes heads, (a) if it is tails and the ball is green, (if itis tails ane. the 
ball is red. Professor Hacking appears to say that this policy is not silly in the long run 
although he would not himself use it; and he regards it as better than the more SPuios 
¢« commonsense ’ rule. But he would regard it as silly if only one urn was being examined 
It is not at all clear why Professor Hacking reaches these conclusions, which I would Haye 
thought clearly showed the strange behaviour of the minimax rule. 


134 Book Reviews 


Professor Hacking’s special contribution 
least as well as d Supports Ah’ (which he Write: 
€, i, d, hare‘ Propositions’. The relation h 

> td, 


tat. E roids possible 
lies i, then hfe<ife. He claims that such a qualitative relation avoid P 
implies i, 


i itati degrees of 
difficulties which might arise When we try to give a quantitative measure to 5 
IFICU ৰ 
Support or degrees of belief. It has, howeve: 


ince it is onl 
er, the disadvantage that ls Ll ৰ 
i imited rar 
qualitative, it would seem to be usefully her limite 
situations. 


aad eld 

is an interesting idea, but that Progress will rE jt 
Up until the central Problem of what ¢ beliefs >, Opinions ° and TEE eh 
নী erly tackled. These Words indicate something Boing on in the mind: whs 

Prop Bp nN as well as a Philosophical one. fe end 
extremely difficult One, but I doubt if the tepped.. Philosop TBs i 
© appeal to criteria such as « the ordinary English use of the word “ belief d to the 
dangerous. Ordinary language i. }; very badly adapts ne 
; । tying to see electrons throug ! 
, Will keep moving about. For i 
i e practical consequences lu 
Seems more likely to yie 
Are appropriate on given data. Bu 


s if teasing, Problem, and Professor Hacking’s 
imulating contribution. CrpRic A. B. SMITH 


By Ever W. Bern and JEAN PIAGET. 
Ordrecht, Holland: D. Reidel. 1966. 


and, in essence, 


Intuitioyysy, Beth ক 
Psychology, and decisive ly 
of Aristotle, ells ogy so the logicist tradition, fron, Leibniz to Russell, 
of GodePs theorems, sentzer’s 0 
Semantic tableau 


eductive Sciences, a Semi-technical account 


eS, 
The latter js jpn Of natural ded 
to the problem of Hh eT ls introdyc, 


eduction, and Beth’s own technique of 

aracterizins ৰ d Primarily to demonstrate One possible solution 

have Le difficul: With this chapter. ematical] re fing, The logically naive reader may 
After the foundations B ্ 

underly mathematica] creation Le more Psychologica] Matters: the Processes which 
beliefs of Poincaré i S discussio, 1S mainly limited to the well-known 
(Pp 91): “যত Phase aration ন 0es Ay very mm h 
conscious work during this Period is c.  anot le. 
is a truism 


uch of interest, He writes 
ad to Profitable k, unless 
' 1S Carrie out i ৰ 2 WOT দর 
, Since no ndepengey definition oF ত fective ways ‘This, alas, 
a y €1 i 


is forthcoming. For 


KG 


NS 


Book Reviews 135 


Beth, the key to understanding mathematical creativity would be the construction of a 
typology of mathematical thought. This may be so; but it is a cause for regret that the 
collaboration failed to advance one. The final chapters of Beth’s section are about the 
role and nature of mathematical intuition, and the relation between ‘ thinking machines 
and mathematical thought. 


For Piaget, the fundamental problem is to discover a psychological basis for abstract 
logic and pure mathematics. But, just as Beth rejects “ psychologism ’, Piaget rejects the 
idea, notably Marbe’s, of introducing a logical factor as part of a psychological explanation. 
He then argues, however, that although the two disciplines are independent, they should be 
co-ordinated within ‘ genetic ’ epistemology, which traces the successive stages of mental 
development. He claims that intelligence is ultimately based upon the interiorization of 
actions originally performed on objects in the physical world, and which become co- 
ordinated into ‘ operational structures’. ‘These ideas are central to Piaget’s theorizing and 
are worth examining in more detail. 

A key concept is that of a reversible operation: an operation which annuls or negates 
the effect of a previous operation. In the early stages of operational thought, Piaget 
discerns three types of structure: those where the reversible operation is inversion or 
negation, involving classes; those where the reversible operation is ‘reciprocation ’, 
involving serial orders; and those whose elementary forms are of a topological character. 
Piaget is impressed by the correspondence between these three types of psychological 
structure and the structures which Bourbaki has postulated as fundamental to mathematics. 
But the correspondence is by no means exact: the psychological structures are what Piaget 
calls ‘ groupings ’ (groupements)—primitive group-like structures which contain a number 
of restrictions preventing free combinatoriality. ‘The ability to use natural numbers owes 
its origin to a ‘ synthesis ’ of one grouping involving classes with another involving serial 
orders. ‘The attainment of hypothetico-deductive intelligence, however, involves not just 
a ‘synthesis ’, but an actual combination of structures so as to contain both types of 
reversible operation. An individual at this stage of development (11 to 12 years of age) is 
operationally at home within the propositional calculus. He now possesses the inverse and 
reciprocal operations which, in combination with operations of “ correlativity ’ and 
‘identity’, make up the so-called INRC group. Of course, the adolescent cannot 
formulate these operations; they are only manifest in his behaviour. 

But why is it necessary to postulate the INRC group? Any such group of operations 
presupposes that the adolescent already possesses the basic connectives of the propositional 
calculus, and their expression in disjunctive normal form. And itis not altogether clear 
what further psychological function is served by the group operations upon these expressions 
— this is especially so for the operation of ‘ correlativity’. Piaget appears to introduce 
certain operations merely because he is committed to group structures. Itis also doubtful 
whether adolescents have the logical ability attributed to them by this model. Piaget 
writes (p. 181) that, ‘ faced with a complex causal situation the subject will ask . . . whether 
fact x implies fact y (which he himself will often express by two propositions which we shall 
call p and q and which he will link by the words “if (p) then (9g) "). ‘To verify it, he will 
look in this case to see whether or not there is a counter-example x and non-y, therefore 
p.9.’ Wason has found, however, that intelligent undergraduates presented with g fail to 
realize that it is important to test whether p may also be the case. Moreover, although a 
complex causal situation may be described by ‘if p then g ’, it is unlikely that this expression 
is meant as an instance of material implication. ‘The reviewer has found, conversely, that 
if undergraduates have to describe the truth-table of material implication in their own 
words, they do not say ‘if p then gq’ but rather ‘ there’s never a p unless there’s a q’, or 
‘ there isn’t a p and a not-g together ’. 

Beth argued that the concept of acquired self-evidence should replace the Aristotelian 
notion of intuitive self-evidence. Piaget further suggests that mathematical concepts 
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As a teaching text it is neither better nor worse than others available. It is a point of 
some moment that those authors who would claim to simplify the symbols and reduce the 
algebra may in consequence produce a book which is neither satisfying to the mathematically 
competent, nor yet sufficiently attractive to those who are less competent. Particularly 
doubtful in this respect is the deliberate neglect of the distinction between population 
parameters and sample Statistics, which is one of the links between the old and the new test 
theory. Perhaps the time has passd when a student of psychological tests can be allowed to 
skim over the algebraic expressions. Yet, though Magnusson recognizes that mathematical 
assumptions and deviations have important behavioural and operational implications, he 
chooses to write in another style. PHILIP LEVY 


Elementary Statistical Theory for Behaviour Scientists. By J. W. CoTTON. Reading, 
Mass. and London: Addison-Wesley, 1967. Pp. x+86. 15s. 
‘This book, which sets out to supplement an ordinary introductory statistical course, 


can be divided into a section on distribution functions and densities, one on inference and 


the estimation of parameters, and one on various practical applications of statistics. Of 
these the first is un 


doubtedly the best and the student who perseveres with this section will 
have a more mature notion of what a continuous density is tha 


n the common one which 
regards it as a kind of blurred histogram. 

The section on inference skims through the usual material including random sampling 
and how to estimate 0° from $2; but no explanations are given. The author seems to be 
hurrying to get to the section on the desirable properties of estimators, though the explana- 
tions in this section are not very good. The minimum expected squared error is said to be a 
desirable property, for example, simply because many statisticians think that cost goes up as 
ror, and several properties, such as maximum likelihood, are merely 


the square of the er! 
named. Bayesian estimation receives attention and Bayes’ theorem is indeed derived, but 


no hint is given as to how posterior distributions are actually obtained. 

The final section deals mainly with testing the difference between two means. Apart 
from the basic formula for the standard error of the mean, nothing is derived and results, 
such as the variance of a sum or difference, are simply stated. There is no attempt to 
justify the method of estimating 0? used in the t test, and the formulae are just stated. No 
reasons, either, are given for why the t distribution has to be used when 0° is estimated, nor 

described. Likelihood ratios of both classical and Bayesian kinds are 


is its distribution 

mentioned but nothing is said about their distributions, so that the student would not know 
how to use these ratios in order to evaluate his confidence in a decision. ‘The Bayesian use 
of credible intervals for a parameter is compared with that of confidence intervals, but while 
this section will make the student think again about the usefulness of hypothesis testing, 
it is only two pages long. In view of all this, the book is more profitably viewed, not as an 
introduction to theory, but as a eginner’s dictionary of statistical terms. 

J. D. VALENTINE 


ntals of Behavioural Statistics. By R. RuNYON and A. HapeR. Reading, 
Mass. and London: Addison-Wesley, 1967. Pp. x+304. 40s. 

It is claimed that this text is at an optimal level of difficulty for one semester intro- 
ductory courses, that results are derived when this can be done simply, and that the authors 
have not been afraid to introduce new material. In fact, the computational formula for the 
standard deviation is almost the only derivation given, and Sandler’s A-statistic (equivalent 
to thet test) for related means the only innovation. 


Fundame. 
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COMPUTER PROGRAM ABSTRACTS 


Project Essay Grade: A FORTRAN Program for Statistical Analysis of Prose. By 
E. B. PAGE, G. A. FISHER and MARY ANN FISHER, Bureau of Educational Research, 
University of Connecticut, Storrs, Connecticut, 06268, U.S.A. 
Description ‘This program (PEG-I) was written to collect information about ordinary 
English text that has been key-punched on to cards by typists. The original purpose of 
the analysis was to simulate human judgements of essay quality. ‘The computer Jjudge- 
ments were generated from the output of PEG-I by means of a standard multiple-regression 
program. For essays of American secondary students, such computer ratings were 
indistinguishable from those of “ expert ’ teachers of composition (Page, 1966; Daigon, 1966; 
Page & Paulus, 1968). In the simulation, PEG-I provides 30 items of information about each 
student essay, to serve as measures and predictors. These include such variables as 
length (of essay, average sentence, average word), proportion (of connectives, prepositions, 
common Words, mis-spellings), and various other counts and measures. It may therefore 
be used for investigations of prose. However, PEG-I is not limited to the present 
dimensions. It is a fairly general algorithm and may be readily adapted for other purposes 


of statistical content analysis. 


A subroutine (Marcotte, 1966) permits the detection in the text of any specified phrases. 


PEG-I was written in FORTRAN IV for use with an IBM 


Language and Configuration 
p to 12 letters long, 


7040 having 32K core storage. It uses and recognizes English words u 
ignoring any remaining letters. These are organized into double-precision arrays having 
two six-character columns, which are also addressed as complex numbers. Most of the 
FORTRAN routines are mnemonic and modular. Short MAP routines are used for 


packing single letters into the double-precision registers, and for unpacking them. 


Input, Options, Output Input text may be on either cards or magnetic tape. Capital 
letters are not recorded, but simple conventions (such as ., for semi-colon) permit recogni- 
tion of punctuation marks and paragraphs. Most key-punched English text could be 
easily adapted to the format. 
Output is provided in two forms: (1) Each sentence is pr’ 
with the obtained frequencies in an accompanying integer a 
and proportions are printed at the end of each essay output. 
are punched out together with the essay’s identification number. 
then be used directly with any standard statistical programs. 


Computation Times An essay of about 400 words requires approximately 40 sec. for analysis 
on the 7040, including time for organizing the strings into word arrays, and performing 
the various list lookup and the arithmetic tasks (but excluding phrase lookup). 


ource deck, including PEG-I dictionaries, may be obtained 
A detailed explanation of 


inted out in word-array form, 
rray. Summary frequencies 
(2) Only these summary data 
‘The output cards may 


Availability The listing and s 
by writing to the senior author at the University of Connecticut. 
its use is given by Page & Paulus (1968). 
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SLOW LEARNING 
By M. FRANK NORMAN 


University of Pennsylvania 


Learning will be slow if the quantity that represents the state of learning 
changes by small steps or with small probability on each experimental trial. 
Approximations to the distribution of this quantity for both types of slow learning 
are obtained. These approximations are applicable to a wide variety of models. 


1. INTRODUCTION 


There are at least two ways that learning can be slow. An experimental 
trial may produce small changes in a subject's habits, or it may produce no change 
at all with large probability. In the former case we say that learning occurs 
by small steps; in the latter we say it occurs with small probability. For example, 
consider a two-choice experiment in which a subject's probability of making 
response A after 1 trials (i.e. on the (n+ l)st trial) is determined by a real quantity 
Xn, A’s ‘ habit strength’, 120. On trial n+1 any one of k events can occur. 
The jth of these has probability ¢(Xn) and, if it occurs, then X= Xn= AXn 
= Uj(Xn). Learning by small steps occurs if the increments have the form 
U(x) =09Vj() and 0 is positive but small. Learning occurs with small proba- 
bility if one of the events, say the first, has no effect on habit strength (U(x) =0), 
and the event probabilities have the form f(x) = cy) (22), and (x)= 
1c) yx), where c is positive but small. 


2 
আতন consider the behaviour of the Markov process {Xn} as parameters 
like 0 and c become small. To see what we should expect, consider the above 
example of learning by small steps under the added assumption that none of the 
Vy or ¢ depend on x and var(Xo)=0. Then 


Xn - EEX] = "E (AXn— BAX), 


m=0 
so that, letting Ym=AXm/b, ; 
Zn =(Xn— EIXnD lO =(0n)ju® > (Ym— EL Yn), 

m=0 
variables Ym are independent and identically distributed 
(P(Ym= Vi)=%1). Hence the central limit theorem implies that as n—>0%0 and 
0 —0 in such a way that nd >t > 0, the distribution of Zn converges to the normal 
distribution with mean 0 and variance to®, where o2=var( Ym). Similarly, if R=2, 
U(x) =1, y2(*) = 1 and P(X,=0)=1 in the example of learning with small proba- 
bility, then the AXm are independent with P(AXm=1)=c and P(AAXm=0)=1-—c. 


Thus the Poisson convergence theorem implies that as n—~o00 and c>0 in such a 
8.2. 
K 


Now the random 
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UE Poisson 
way that nc—>t, the distribution of => AX; converges to the 
distribution with mean t. Gio ci SR 

In this paper we will obtain extensions of the central limi De bys 
theorems sufficiently general to cover most of the instances of len oe Ae 
steps and with small probability that arise in learning models. he ae Ce 
a point of considering multidimensional as well as unidimensiona ud a) Glee 
Processes, which arise, for instance, in models for choices among sev Peony and 
tives or in multi-process models, are already important in learning nptotic 
seem destined to become even more SO. In all cases we will find the ডে ingle 
distribution P, of Xp (suitably normalized) as n—>o0, 7(=0 or c)=0 Et 
In applications Pr can then be considered an approximation to the dis 
Fur of Xn when th 


. from 
€ learning rate parameter r is small. Indeed, it follows 
our results that, for any T>0, 


max d(Fr, Prr)>0 
n:nr<T 


he 
থি ডি 3 een t 
1S a suitable measure of the distance Pehle nr 
The restriction nr <T' on the natural time Va 


2. SMALL STEPS 
2.1. Assumptions 
ভ ction are abs umbers 
by small steps Biven above. Let J be 2 bounded set of positive real n rocess 
having 0 as a limit point. For every 06J, let {Xe}. be a Markov F 

with Stationary to 


real interval IL. We Ssume (a) that the con 
Biven X= does not depend on 0. Let 


MS) = EY X=), *()= EY) Xt =), 
(0) =2() — ox) and (x)= E[|Y 0X0 =]. 


at w(x) has t 
(c) that 7(x) is bounded. Udita Mins 


earning 


ti d g(t, *) such 
- 6 Assumptions we will d i i t, x) an i! 
that, if the initial state of {Xe} i: yp (Penne Sanctions ) 
normally distributed with 
nbd —>t (Theorem 2.3). 


8 0, a 
Variance 0g(t, X) as 1-700, 
(Corollary 2.2.2) the ‘m 
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‘The picture we derive of a fundamental theoretical variable AXn® with a 


tightly clustered nearly normal distribution is reminiscent of the situation postu- 
lated by Hull (1943, p. 319). 


2.2. Two Lemmas 
Since |w(x)| <21%x) <r13(x), (c) implies that w, 2 and s are bounded. Let 
2=sup s(x), b=suplw(x)l, c= suplw"(x)|, 
zel zel el 
d=sup w(x) and g=suphw'(x)l. 
zelI eI 
When is the initial state of the process {Xn}, we sometimes denote the 


corresponding probabilities, expectations and variances by Pz, 


e h Ez and varz, 
though, more often, especially in proofs, our notation will reflect neither x nor 0. 
Let 


Mn =Hanl0, x)= Ei[Xn°] 
and 


Wn = wn(6, x) = varz( Xn). 


LEMMA 2.2.1. For an xeIl,n>0 and 0 ন 
0(24+08)> — 1, y ?’ J sufficiently small that 


wn < 0a A(n, 0, 2d + 0g), 


where 
An, 0, h)=0"S (1+ 01). 
3=0 
Proof. On 1 =Var( Xn + AXA) 
ন = wn + Var(A Xp) + 2cov( Xn, AXn). (2.2.1) 
cov( Xn, AXn) = E[E[(Xp —un)(A Xn — Aun)|Xn]] 
=X, Ln) E[AX, — Ana |Xn]], 
and 
E[AX, — Aun |Xn] = Hw Xp) — E[w(X,)]) 
=0 
So that ((o( Xp) — wo( un) + (wun) — E [w(X2)])), 


cov( Xn, AXn) = 0E[(X —hn)(w( Xn) — wo(2))] 
=0E Lox —hn)* (w( Xp) নং W(un)) 
Xn Hn 
<OdE[(X, — py)t] = 0d, (2.2.2) 
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by the mean value theorem. Also, 


var(A Xn) < E[(AX, — Gu(pe))2] 
=0E[E[(Y, — w(un))2 |X], 


and l 

EY, — w(tn))°|X] = (X) + (w( Xn) — w(un))2, 

<a HEA —Ln)*, 
so (2.2.3) 
var(AXy) < 02a + 0g2wy,. 
Combining eqns. (2.2.1) to (2.2.3) we obtain a 4 
on 1 <(1+06(24+ 0g))wn + 02a. hg 
a i d notin 
The conclusion of the lemma is obtained by iterating eqn. (2.2.4) an 
that w,=0. Q.E.D. 
If hc<0, 
An, 0, Ah) < 1/nj, 


Whereas, for any h, 


An, 6, hn) <A(n, 6, Al) 

<[(1 + 0|hjn — 1/n 
S[exp(n0fn]) — 11/1. 
Thus we have the following corollary, 


oo. (ii) If 
CoroLLARy 2.2.2. (i) wn = O(6) uniformly in 19 < ZL for any L < oo. 
» Wn = O(0) uniformly in 2 > 0. 


By (i) we me n that there is agd,>0 
for any L there is a B 


<! 


* h that 
(in this case Sup J is suitable) such 


0<L- 
wn <B,0 whenever 9<0, and 
a0,>0 


that 
e.g. |d|/g2) and a B < oo such 
nd n>0 (eg 8 
5 ction 
LEMMA একে For any yer there is a unique differentiable fun 
=] that 
F(0)=f(t, x) such tha | ul 
FO) =f [0)) 
and a 
2.2. 
(0) =x. ( 
(i) un —f(n0) = O(0) uniform! 


yin n0<L for any L. 


(ii) Tf d <0 then wn — f(n6) = (0) uniformly ip 7, > 0, 
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Proof. hn = ELXn + AX] =Hn + E[AXG] 
=pen + OE[o(Xa)] 
=pn + O(n) + OE[eo( xn) — w(pen)]. (2.2.7) 


By Taylor’s theorem 
UW Xn) —Wlhn) = (hn) Xn — hn) + enc( Xn —ten)*/2, 
Where here and in what follows e, denotes a quantity such that |e,|<1. Thus 
E[u(X,) — w( by )] = cenow, [2. 
Substituting this into eqn. (2.2.7) we obtain 
ng =Hn + OW ty) + Oe, /2. (2.2.8) 


Now if uw, is to have an approximation of the form f(n6), then f(0)=u,=x 


and, by Corollary 2.2.2 and eqn. (2.2.8), (n+ 10) = f(n6) + 0w(f(n6)), or 
F'(t)=w(f(1)). Assume for the time being that I is the entire real line. ‘Then 


standard theorems on ordinary differential equations (Birkhoff & Rota, 1962, 
Theorem 1, p. 103; Theorem 6, PP. 112-113) ensure that there is one and 


only one solution to eqns. (2.2.5) and (2.2.6). We will now prove that 
hn —f(nd) = O00). 


Let v, =v, (0, x) =f(n0, x) and n60=t. Then 


[) 
Vt — Vn — Bol) = || (f+) far, 
and 
If'e+) fl <elfe +) 00) 


=8lf'( + 7*)|r <gbr, 
Where 0 <r* <r. Thus 


bys —vn — wl, )| <gb02/2, 


so that 
V1 =v, a wl, ) Ey engb0°]2. (2.2.9) 
Let A, =v, —hn. Subtracting eqn. (2.2.8) from eqn. (2.2.9) yields 
Ap 1=An + (wn) — W(hn)) + Ory, (2.2.10) 
where 
lr, |] <(gb + coo, 0) /2. 
Thus 


bea E 406°) — (pn) 
Hn — hy, 


| Ay, + 0%, 
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If 6g <1, the coefficient of An is positive, so that 


f 
Bole (10th) a anal 
টু = 
<(1 + 0d)A nl + Grn). 
Tterating this equation, and noting that A,=0, we obtain 
n~-1 
Anl<ce >, (1 + 6d, 
ny (2241) 
<5 sea, ays es (1+ dys, 516). | 
my rollar 
The conclusions of the lemma then follow from eqn. (2.2.11) and Co 
22:2. 
It remai. 


ire 
7 0, 00). Extend w to the it 
Premum and in such a way that the লা e 
SSI —y| is satisfied for all real x and y. For (0) for 
if B=inf I> 0 (C=sup I< 00) we can take W(x) =u(B) (w(x) = ul tended 
x<B (%> C). Eqns. (2 -6) have a unique solution for the ex lace 
Work in the preceding Paragraph is valid if dis that 
by g. Corollary 2.2.26) and eqn. (22.11) (with d replaced by 8) AP nique 
I for all t>0. ‘Thus f is the kl the 
the unextended function w, an 
Pplicable as it stands. Gm. 


solution of eqns. (2.2.5) 
Work of the Preceding 


Te 28. The Main Theorem On Small Steps 


(t= [* (2.3.1) 
£5 )= | Uu, %)) exp [2 | Uo, x))dv | du. Bh 
06? 
ti Bony 2. + For Any wey, if is the initial state of {Xn®} for all nd 
en t e distribution of Zn=(X o_O gun es to the no 
distribution Pig with mean 0 and wo; (6, 4))/0V2 converg' 


0. 
k P12 
Variance g(t, 2) as n—>00, 00 and nd 


‘tude com" 
y :. OMIt most details. of magnitu 
BT ONE omitted Te simi]. to those in the» Raine oly istic 
‘st Wwe obtain a difference gig, ti 1 ng for the character 
function hn()) of 1. Observe that Mal equation for 
2) = 

ny 1, Heng, yy ™ Blexp(a) Zz, )exp(AA Z,)] (2.3.2) 

For real y ll OLE A) a 


2+ 0), 


thus 
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IAAZn)\Xn]=1 + DAE[AZn\Xn]— MEA Zn) Xn)I2 k 
Bean aun Ui + WPO(ENAZne\XD- (2.23.3) 


Using Corollary 2.2.2(3) and Lemma 2.2.3(3) repeatedly, we obtain 


E[SZn\Xn] =0w (rn) Zn + O(6%2(1 + Zn) La 


E(UBSZA Xn) পথ Bsn) Ed O61 হট Za), La নটী 


Where a= fln8). Flere and subseouenthy 2 Os are uniform WHEN NB 15 bounded. 
Also 


EUSZnP\Xn)=02E(\Y»— ELYn)R\Xn) 
< O(EVS[| Yu |X] + EL Yh 
by Minkowski’s inequality. But E[Y»]=£ [w(Xn)] is in absolute value no 
Sreater than supz|w(x)|l, and E[|Yn|s|X,] =7(X») is bounded by assumption (c). 
(Here we use the full force of (c) for the first time.) Thus 
E[|AZ,|s| Xn] = 0(6%2). (2.3.6) 


Substituting eqns. (2.3.4) to (2.3.6) into eqn. (2.3.3), substituting the resulting 
expression for the conditional characteristic function of Zn given Xn into 
eqn. (2.3.2), and using Corollary 2.2.2(i), we obtain the desired equation 


ha) —hn(2) = Ow (rn) Man’ () — (n)(A°/2)hn(2)] + O(692). (2.3.7) 


‘The error term is of the order of magnitude of 03/2 
16 and 2 are bounded. 

‘The difference—differential equation (2.3.7) suggests the partial differential 
equation 


oH 
aX 


uniformly over n and A when 


aH GA) A 
3 0D -w((ON GS (0D -(O) 2 HG, DN. (2.3.8) 


We will show that the solu 


tion of this equation for which H(0, 2) = hy()) 
the characteristic function 


=1 is 


H(t, N)=exp[ — (2)2/2] (2.3.9) 
of the normal distribution with mean 0 and varian 
as 820, n> and n6 >t. ‘The key to both Objectiv' 


BOY) = (10) BO) 
for which B(0)=1. That is, 


Bl) =exp [ = fF (ae | ট 


(2.3.10) 
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Ean. (2.3.8) implies that fl 
(6 €B0))= — sr) EEO) H(t, £B()), 23.41) 
t 


2 
so that, since H(0, £)=1, 


He EB) =o | -5 f' (EB) Can | 
for all t and €. Choosing €=)B-! 


general discussion of this technique 
For any real €, let &n= 
(2.3.11) suggests that 


i 3.12) 
Hi, —Hn= — 0sn)én2H (2 + (02), (2 


and this is easily established via Taylor’s theorem. 
Also, writing 
hy ~hy চে ME) —h i (En)) শ্ঁ (h(En) ও” hy(Eén)), 


the 
and applying eqn. (2.3.10) to the first and eqn. (2.3.7) to the second term on 
right, we obtain the analogous equation 


For a 
(t) we finally obtain eqn. (2.3.9). LE 
» See Courant & Hilbert, 1962, PP: En, 
£B(n0), Hn= H(n6, &,), and hy = h(n). 


.13) 
hii —hn= — Os(vn)Enthn/2 + 0(6%2) (2.3 


Subtracting eqn. (2.3.13) from eqn. (2.3.12) we obtain 
= == = s(n) En2 2H, =hn) + O(98%2), 0 
> 
which implies that Hy-—hy =O(0U2). But Hi, H(t, £B(t)) and hp —n(EB(2))° % 
as n0 >t, hence (EB) HG, €B(t)) as n 700, 00 and n6 >t, for all £ Fete 
A change of variables £=AB(t) and an application of the continuity theo 
complete the proof. Q.E 


for hy. 


D. 


Another method of soly 


eqn. (2.3.1). Assuming that 
£, and substit 


ing the differential 


eqn. (2.3.9) is valid 
uting into eqn. (2.3.8) 


4 
£0) =2%(f())g() + (f(D), (22.1%) 
to which eqn. 2.30) si 


yl °g b h 
The partial differential ৰ ঢা ic 235 ul 
ay BB 
gt (09) =15(f00) 5 Gy)-w( 0) (4, )) 


KR 1). 
Sity with mean 0 and variance £( ) 
can be obtained from the equation 


OV 
tation by parts ‘This is the Kolmogor' 
forward or Fokker—Planck equation (Rosenblacy 1962 P. 137) for a continuous 
time Markov process Z(1) that may ber 


(Xd —pn)/0V2 (t corre 
comparable approxim 
will be considered in 


An approximation to the process 
sponds to n6) Small. The nature of this 1G 
ations for the Processes discussed in Sections 3 and 6 
a subsequent Paper. 


When 0 is 
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3. GENERALIZATIONS 
3.1. Generalizations of Theorem 2.3 


Asis shown in Section 4, Theorem 2.3 covers the linear model for two-choice 
experiments and the beta model for such experiments when it is formulated in a 
certain way. Various slight modifications of the hypotheses of Theorem 2.3 
yield a new theorem with a much broader range of applicability. ‘The modifica- 
tions encompass (i) multidimensional Xn, (ii) weak dependence on 8 of the 
distribution of AXn/0 given Xun, and (iii) state spaces I, of {Xn®} that vary 
slightly with 0. Both (ii) and (iii) arise, for instance, in M element fixed sample 
size stimulus-sampling models for two-choice experiments, where 6=1/M and 
Xn is the proportion of stimulus elements conditioned to one of the response 
alternatives after 1 trials. 

Let J be a bounded set of positive real numbers of which 0 is a limit point. 
Let N be a positive integer. Ry is the set of N-tuples of real numbers, which 
Wwe regard as column vectors. For every 0€J, let {Xn®},>0 bea Markov process 
with stationary transition probabilities in a subset I, of Ry. Let I be the smallest 
closed convex set including all IL, (0eJ). Let 


Yn®=AXp°/0, w(x, 0) =E[Yne|Xn® =x], 
(a, 0) = ECY no — ols, O)( Yu — (as, 0) [Xe =a] 
and 
(a, 0) = E[ Yn Xn =x), 


bret indicates transposition, and |y|?=y?y for YERy. Thus w(x, 6) is the 
con টা mean vector and s(x, 0) the conditional covariance matrix of the 
normalized increment Yn, given Xn®=wx. For any Nx N matrix A, let 


N 
l4R= Y ag 
tj=1 


i We assume that I, approximates I as 90, in the sense that, (al) for any 
x el, | 


lim inf |e —y|=0. 


820 yelg 
Next we suppose that there are functions w(x) and s(x) on I th: i 
| it 
w(x, 0) and s(x, 0) when 8 is small, by which we oi ডি (a2) EEA 
SUD lu(w, 8) —w(x)|= O(0) and sup s(x, 0) = s(x) |=>0 
zely ED 


as 00. ‘The function w is assumed differentiable in the s 


is an Nx N matrix valued function w(x) on I such that ense that (bl) there 


lim 20-0) w(x) - | 
Ig ly-| 


0, 
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for all xeI. We assume that w'(x) is bounded: (b2) 
sup |w'(x)| < 0, 
4 


and that w(x) and s(x) satisfy Lipschitz conditions (b3) 


wt s(x) — s(5)| 

Fp lo'(x) — (5) | <0 and sup 0) <০ 
2,yel x—yl| 2,yel —y 

2y id 


Finally, we suppose that (x, 6) is bounded: (c) 


Sup 7(x, 6) < oo. 
96, zely 


_nnld, |]. 
For 0€J and xe/,, let hn(6, x)= Ex[X 0], and wn(0, x) = Ezx[ | Xn? —un(0, x) 
Under the above conditions we have the following theorem. 


any 
Tisorey 3.1. (i) wu(0, 2) = O09) uniformly in wel, and nd <L, for 
L<o. 
(ii) For any xel, the differential equation 
FO) =uw(f0)) 
=, x) for which f(0) =». (0) el for all t>0. 


Mn, x) —f(n6, ») = O(6) 
uniformly in el, and n0 <LI. 


has a unique solution 0) 


(iii) For Any x eI the matrix differential equation 


BO) = -w(f(, ))"B(t) ix, BO) 
has a unique solution B(t)= B(, x) for which B(0) is the identity matrix. cli. 
1S non-singular for all t>0. For every 66J let {Xn°} have initial state es 
If xe], and > as 0-0, then the distribution of (Xn? —nn(6, x,)) |? converg 
to the normal distribution With mean 0 and Covariance matrix 


86») = | ৰ [Bu »)B 


“(tf SBC, x) BG, m)jdu 
as #—>00, 050 and n6 >t. 
In view of (ii), we Can, of course, Substitute (nd, x) for un(6, x6) in re 
Tt can be shown that f(t, x) 0, )= O(a) uniformly in t<L for a 
L < 0, so f(n0, x) can also be S0 substituted if [x9 — | = 0( 012). her 
A proof of Theorem 3.1 can be obtained by straightforward but rat 
tedious modification of that of Theorem 2.3. 


St 
f the more Seneral theorem are due almo 


Pendence of {X,e} on 0, rather than to 
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3.2. Generalization of Corollary 2.2.2(ii) and Lemma 2.2.3(ii) 

We begin by noting some implications of negativity of d=supzey w(x), 
within the framework of the assumptions of Section 2.1. The function “(x) is 
bounded, so I=[a, b] must be bounded, and w has at most one zero in this 
interval. If w(a) <0 then 

ul0, a) =a + 0w(a) <a, 
an impossibility. Thus w(a)>0. Similarly w(b)<0. Thus w has a unique 
Zero p, and w'(p) <0. These properties of w and boundedness of I are sufficient 
to obtain wn(6, x)= O(6) and pn(6, x) —f(n6, x) = O(6) uniformly in 1, even under 
more general assumptions about the dependence of {X»°} on 0. 


We now state our assumptions precisely. J, {Xn}, Io, I, Yn® and w(x, 6) 
are as in the second paragraph of Section 3.1 with N=1. Let 


(wx, 0) = E[((Y n°) Xn = x]. 
We assume that (al) and the first condition of (a2) hold, that w(x) has a bounded 
derivative that satisfies a Lipschitz condition, and that 2(%x, 8) is bounded. 


Finally, we assume that I is bounded, that w(x) has a unique zero P, and that 
w'(P) <0. Under these conditions we have this result. 


TnuroreEm 3.2. (i) var Xn) = O(6) uniformly in wel, and n> 0. (ii) For 
any wel, there is a unique differentiable function F()=f(t,) such that 
F'0)=u(f()) and f(0)=x. Eax[Xn]—fn0, )=O(0) uniformly over x el, 
and n> 0. 

‘This theorem ca 
of Corollary 2.2.2(ii) 

‘Theorem 3.2(' 


1 be proved by suitabl 
and Lemma 2.2.3(ii). 


i) yields immediately 


y refining the arguments in the proof 
We omit the details. 


lim sup varz( Xn) = O(0). 

By Lemma 5.1 below, f(t, x) “Pp as t->0, so that (ii) implies 
lim sup IExLXn®] ~pl= O(0). 

The problem of the asymptotic distribution of (Xn — p)/0V? as 9 >0 after n-—>o%0 
under hypotheses like those of Theorem 3.2, has been considered by Norman & 
Graham (1968) and Norman (1968). As one might Suppose on the basis of 
Theorem 3.1 and Lemma 5.4 below, the limiting distribution is normal with 
mean 0 and variance s(p)/2|w'(p)|. 


4. EXAMPLES 
Consider a two-choice (4, or As) learnin i i i 
\ £E experiment i 
response can be reinforced on any trial, regardless [অ পলকাড - ক 5 
makes. Let J be the Probability that Aj is reinforced if Ai is mad oe’ 
bn denote a subject's A, response Probability after n trials (n=0,1 5 ih 
51S 


2 M. Frank Norman 
15. 


; Ai 
‘Ey. HEE f response js 
he linear model (see Sternberg, 1963) ein 3 iF Ad, 
কা is followed by reinforcement of Ay, while Apn 
on tri 


+ d aft A In th. b 5 b. 1963 =Y 1 + 0n), 
TCE er i n the beta model (see tern CTE, ) Dn nl( 
reinfo. 


ঠ <1). At the 
=Biyon if A; is made and Aj is reinforced (Biu>21, Bi2<1) 
LU the corresponding transformation is 


1 s 
In the M element Ht 
Dp i 
sample size v (see Atkinson & Estes, TL us 
Proportion of stimulus elem i 1 after ntrials. Ift 


me 
ll elements in the EE oR 
Sponse (those already so TE EI 
Apn=(—-k Mif 4,is reinforced, while Ap, = —R/Mi 2 ith 

change), so Ap, = (v )/D 1 ) Conditioning is effective wi 


ls. 
€ applicable to all of these লালা 
In the linear model we i Parameter 0 by letting 0,7 = 04 for co 
dy>0. Thus the vector 


0, 6s, 05), 622) 


dimensional 
(0, 1/max dy), it i 


ing 
Space as 0 approaches 0. Taki 
It is easy to 


লী he 

Verify that X0= pn i ক্ৰ ‘He 
pp: ST 28 in all of the Subsequent examp 0 its 
ment AX, 0/9 is bounded, $0 the boundedness condition 
oment is trivially satisfied. Clearly 

“U(B)=dn, (1 7D tdymy(1 py “diana? _ dsr2(1 — p)p. ্ 

n 
If we take fry = exp(b,,6) (bu >0, bis <0, 0.20 <5), then In By = 0b, a 

the process X= yy i 


23 
; model satisfies the hypotheses of ‘Theorem ৰং 
Tale 7 is applicable to bn. For this proce 


1°=Dp in the er” 
| io = : > i 0, 1 M, Say 
2 "J. In this case it can be hob gj Ib {0, 1/ 


Ce 
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In the linear model, positivity of iy and dij for iJ is necessary and 
sufficient for w(0)>0 and w(1) <0. Since w is quadratic, it then has a unique 
zero p in [0, 1], and w’(p) <0, so Theorem 3.2 is applicable. The same observa- 
tion, with dij replaced by cig, is pertinent to the stimulus sampling model. The 
process pn for the beta model never satisfies the hypotheses of Theorem 3.2 
since w(p) always has at least two zeros in [0, 1]. 

As an example of a model for which Xn® is multidimensional we consider 
Wyckoff’s discrimination learning model (see Bush, 1965, pp. 172-175). This 
model is partially specified by Bush’s Table 1. ‘To complete the specification 
we replace all plus and minus signs in the table by functions of the form 0t(v), 
where is the variable being transformed, {(0)>0 in the case of a plus and 
(0) <0 in case of a minus. The functions ¢ corresponding to different plus 
and minus signs may differ. All U's must satisfy 0<v+0t(v) <1 for 0<v<l 
and 0<06<5 in order to keep v in the unit interval, and all should have two 
continuous derivatives. Then Theorem 3.1 applies with N=4 if we take 
Xn = (xn, Jn, 2n, Un)", J = (0, 5), and I=1, the closed unit cube in Ri. 


5. THE FUNCTION f 
In order to facilitate use of the approximation f(n 
here a number of observations concerning the function 
the one-dimensional case. 


0) to E[Xn®] we collect 
f of Theorem 3.1(ii) in 


LEMMA 5.1. Let x el, and f(t)=f(t, x). 
(i) IF w(x) =0, (2) =~ for all 1> 0. 


(ii) If w(x)>0, f'(1)>0 for all t> 0. If w has no zeros above %, 
lim,» (2) = ©. If y is the smallest zero of w above x, lim. w f(2) =. 


(iii) I w(x) <0, f(t) <0 for all t<0. If w has no zeros below %, 
limo (= —_o0. If y is the largest zero of w below x, lim, (2) =: 


(iv) If w(x) #0, f= F-1 where হা 
F(2)= | ন du. 


‘For an real 2 we denote the function on [0, 00) that is identically 
SE i z*, br w(x) =0, then x* solves x*'(1) =u(a*(0)) and x*(0) =x. 
Since f(t) is the only function with these properties, f0)=**(0)=* for all > 0. 

Suppose now that f'(0) =w(x) >0. If f'()=0 for some t, let >0 be the 
smallest such t. Then w(f(7))=0, 50 f(e)* solves eqn. (2.2.5). Since this 
solution agrees with f(t) at t=, f(2)=f(7) for all t>0 (Birkhoff & Rota, 1962, 
Theorem 5, p. 20). In particular, x=/(7). However, f'(2) >0 for 0 <t <r, 50 
it is clear that x <f(7)- This contradiction shows that f(D) >0 for all 1>0. 


(5.1) 
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It follows that uw(f(1))>0 for all t> 


i inuous, 
0. Since this function is contin! 
1/w(f(2)) is also. Rewriting eqn. (2.2.5) 


in the form 
F'C)lu(f())=1, 

een 0 and t we obtain 
‘ FC) dr=t. 
ow(f(7)) 


=1(7) this becomes 
Which is (iv). 


[0) 
J Ah du=t, 
ll 
oo |for a 
Tf w has no zeros above %, then w(u) > 0 for all uv >. If f()<B<o| 
1>0, then oo > F(B)> FC) 


ve 
. 0S abo 
=t, an absurdity, Thus if w has no zer' 

2, f(t) >00 as t-> 00. 


If w has a Zero above y» 


and integrating both sides betw 


Under the change of Variables u 


then 

» let y be the smallest, If f(1) =y for oR § for 

Now uu) >0 for all x <u cy. TAY, 

ll 1>0, then co > My 5)>2 for all +> 0, which is imp 

=>. 

© case w(x) <0 is s; ilar to the Case w(x) > 0 
Ean. (5.1) reduces the i 

In the thr 


expansion of 1 


lu(u). For instance 
Say w(u) =K(u 


~Mu-y), th 


: 5S 
at arises in the linear and ee” 
‘Most all of the results fob) (Gand 
iterature apply only when pst) only 
lytical imation LOD Of B. Except for these Cases, f(n6) is t fe thiB 
HE Tages [pn Current]y available, A Special case 0 d 
apie nS Wes Dopo Bush & Noster 1955, eqn. 8.64, p. 183) an 
applied to experiments] data wi ( » eqn. 8.64, 

Their derivation u 


্‌ 3.8. 
Suggests th, he At SUccess tn their Sections 13.6 to 1 
দিছ S that the error wi 
§lve no conditions for small F Will be sya 
It is not alwa 


Lif var(p,) is small, but they 
vy Vat( Dn). 

JS possible to 

even when it is tl 


| 5 and 
0 express f in terms of elementary functions, 

he expression Need not i 

about f can be Obtained direct 


. ion 
Ci uminating. Some in 
: 1 
both sides of this equation eqn. (2.2.5), For example, differen 


UO). 
Md only ip yg 


We obtain 
F(t) = uw 


If w(x) 20, then F() 20, so f(0)=0 if 
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Lemma 5.2 gives simple asymptotic estimates for the deviation of f(t) from 
its asymptote. Part (i) can be obtained directly from eqn. (2.2.5), while (ii) is a 
corollary of Lemma 5.1(iv). The proofs are omitted. 


LEMMA 5.2. Suppose that w(x) #0, and f(t)=~yelast—>0%0. Then w(y)=0 
and w'(y) <0. 

(i) If w'(y) <0, then 

fre) ancy | de 
o-tin fy” 0) Jat 
exists if 2 approaches y from the direction of x, and 
{0 -y~(—y)explG(x) +t 

as t—>0. 


(ii) If w(u)|~ lu—yliA for 5>0 as u approaches y from the direction 
of x, then 


f0)—yl~ (SAY ™t 
as t-—>0%0. 
The final results in this section concern £(1). 
LEMMA 5.3. (i) If w(x)=0, then 
800) =s(a)(e2 1 — 1) [2w'(). 
(ii) If w(x) #0, (2) = T(f(t)), where 
= s(u) 
= 2 ———— 
T(z) = (2) Ff ou) du. 
Part (i) (read 8(0) =s(x)t if w(x) =0) is obtained from eqn. (2.3.1) and Lemma 
5.1(i) by straightforward computation. Part (ii) can be established by intro- 
ducing f as a new variable of integration in both of the integrals in eqn. (2.3.1). 
LEMMA 5.4. If f(t)~>y el as t—>00, and if w'(y) <0, then 
n= (y) 20’ ) |. 
We will not prove this here. However, we note that the form of the limit is 
easily obtained by assuming that the limit exists and that g'(t) 0, and letting 
t—>o00 in eqn. (2.3.14). 


6. SMALL PROBABILITY 
Let {X,}n>0 bea Markov process with stationary transition probabilities 


K(x, A)= P(X SAX, =) 
in an abstract space I. The higher transistion probabilities 
KY(x, A)= P(X; AIX, =x) (6.1) 
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are given recursively by 


6.2 

Kis, 4) = } Ks, d)KO(y, 4). (6.2) 
pi 

From eqn. (6.1) we see that K' 0x, 4) = 62(A), where 


1 if xe 


= if xed, 
and KO®(y, A)=K(s, 4). 


For 0<c< 1, let {2X 
transition probabilities 


¢} be any Markov process with 


3 

Kilw, 4) =cK(s, 4) + (1084). (6.3) 
In the next section we will show that the parameter c discussed in Section 1 is 
of this type. 


In order to see What we should expect of 


P(XneeA|X,e =X)= Kony, 4) 


When 1 is large and cis small, we consider an example of a Markov process with 
transition probabilities (6.3). If {W; Y>1isa Sequence of random variables oe 
PWj=1)=c and MW;=0)=1-c that are independent of each other and 0 
{Xn}, and if S(n) = bl Wj, then Xs iS such a Process. Let {N() Jt>0 be a 
3=1 
Poisson process with mean inter-event time 1 that is independent of {Xn}. 
As n>, c>0 and ct, the distribution of S(n) Converges to that of Nd). 
Hence we expect the distribution of Asn to Converge to that of X 


N(, le. 
Kony, A) Lily, 4), 


(6.4) 
Where 
Lis, 4)= y Ko, A)e-tt3;1. 
5=0 
a a BIDcEss (Xo is DE Pseudo-Poisson type (Feller, 1966, pp. 


Theorem 6.1 is 


Somewhat stron 
[9] is the least intege 


Ber than expression 
T greater than 


(6.4). For any real y, 
OF equal to y, 


THeorew 6.1. For all 1, c>(, 
Kea, 4)_ To, Dl expec] mc) ~ jb 
where {(y)=1 - ([y] — 1/2. 
It is easy to show that 1 


&(y)>43 as y—>w0 “<U) < for 50, (Y)=1 for 0 <y <1, and 
y 
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Proof. The formula 
nm 
Ke'n(x, A)= > K(x, A)bs, (6.5) 
3=0 
Where 
1 
be 0) d(1— cn 
follows easily from eqn. (6.3) by induction. Let b;=0 if j>n, and let 


p=eno(ncy fil. 
Clearly 


Ketl(w, A) — Lnelx, 4)= 2 KO, A)(bs—21) 
5=0 
< সX (b-m=D 


5:0j>25 
and 
Ln, A) — Kew, Ag, > (pi—)=£. 
j Pj 
But 0; | 


0= 2 (by-m)=D-E, 
3=0 
so D=E, and i 
IKe(w, A) — Lunes, DISD. (6.6) 
Let aj=b;j/pj. ‘Then 
D= XY pa-l) 
jraj>1 
< max (aj—1) » Ed 
5: aj>1l 3: aj> 
Sah (6.7) 
0<j<n 


A routine Gon p yg 
putation yields 
mld 
= T™jo-ne 
Where (n)j=n.(n— 1)...(n—j+ 1), so 
dj n-J 


fc Go if j>nc. It 
OF <n. Thus aj,i>aj if j<nc, aqj1=d if j=nc and ajj1<aj ¥ J>Me, 
follows that 6.8) 
max aj= dnc] \2 
Writing O<j<n 
a=(1-—1/n)...(1-(— DI —c)riene, টা 


L 


158 M. Frank Norman 


and applying the inequality @¢2>(1-—x) to all terms but the last, we obtain 
J 6.9 
a <exp[jc(1 — (J — 1)/2nc)]. ( 


ining the 
The proof is completed by taking j =[nc] in eqn. (6.9) and combining 
resulting inequality with eqns. (6.6) to (6.8). oQ.E.D. 


7. EXAMPLES 


The parameter c Can be introduced into a broad class of learning ভল পল 
as follows. Suppose that the variable An that describes the state of learni 8 
after 1 trials takes on Values in an abstract space I. Any one of k events ca 
Occur on trial n+1. 


The jth of these effects the transformation 


Rd 
Ki = FE(Xn) ( G 
in the state of learning, 


and occurs with Probability (Xn) (J=1...., R). 
k 
Naturally (x) =1 and E(x) el if $x) > 0. 
=1 


tobability dh Xn) = by*( Xn), 


Hy =F (Xn), 


Where F, is the identity Operator, with Probability 


the model When th 1 fix the vector 
(dy, ..., di) and let c>0 Let {Xe} be the sn ng ngs eS 
to the supe 


uence of states corresponding 
f the Parameter, Jet 


Probabilities for thi, k ” “ 
Ks, A)=Ki(, 4). 0 Justify this not... OV Process, let Xn=Xnl, and 
in Section 6 is applicable, we must sh, 


ৰ 
%& 
d OSI<k: Fc)e4 % ( ) 
an 


LL 
box) = (l= 2)g(x) +(1- c), 
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So that 


K, x, =c Lz 
od) স d+ [> 0-dH+( -9] 4) 


1<j<t: Fj(z)eA 
=cK(x, 4) +(1-0)8A(4). 


Thus, indee 

(6. % apply FE MN It follows that Theorem 6.1 and its corollary 
ito EET described above may be used to introduce the parameter ¢ 
seb caramel i and Wyckoff models of Section 4. Linear models involving 
The aE jt been considered by Norman (1964) and Yellott (1965). 
model discussed 0 ত already been introduced into the stimulus sampling 
to writer cd ET 4, and to complete the construction it remains only 
theory. J e parameter cis a standard feature of stimulus sampling 
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LATENCY AND RELATIVE FREQUENCY OF RESPONSE IN 
PSYCHOPHYSICAL DISCRIMINATION 


By A. R. PIKE 
University of Strathclyde 


[' ৰ 1 . . . 
T he relation between response latency and probability is discussed with 
Particular reference to differences between mean times for correct and incorrect 


responses in difficult two-choice discriminations. ‘The predictions of some 


finite Markov chain models for this situation are considered, and possible modifica- 


tions of these simple systems which may bring about features of observed data 
are discussed. Some results, in the form of curves of the latency-probability 
relation, are presented from earlier work and new discrimination experiments. 
‘These data are discussed in relation to theoretical interpretations based on a 
particular stochastic model which describes psychophysical discrimination in 
terms of the accumulation of units contributing to the overt response. 


1. INTRODUCTION 


bees a paper by Audley & Pike (1965) it was pointed out that the relation 
Soni n the probability of making a response and the time to initiate it is of 
several rable Importance for testing models of choice. ‘There are, of course, 
partic es of controlling response probability in choice situations, and the 
Varyi ular method considered in both the earlier paper and this one is that of 
eS the difficulty of a discrimination task. Difficult tasks clearly lead to 
and ne probabilities close to a half, and easy ones to probabilities close to ) 
situ Es respectively, for incorrect and correct responses. The exit 
to deci which will receive most attention here is one in which the subject has 
liner ie which of two stimuli is farthest in one direction along some be 
required A number of discriminations varying in difficulty provide t 
atencies set of different relative response frequencies and their associate 


to : . 
Stimuli, and many theorists bave found it simpler to discuss a choice model 


He . . . 
fone terms of the relative frequencies with which different alternatives are 
exce nM. The statistical decision theory of psychophysics is similar in this respect, 

, rterette et al. (1965), Sekuler 


detailed investigation wil 
and frequency may 
derlie decisions 


(196 
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2. THs LATENCY-PROBABILITY FUNCTION 
under Consideratio 


Y for a response probability of 0-5, 


a 2 i or 
empirical case, a sufficient number of P points f 
reasonably accurate interpolati 


=0:5 


Mean latency /latency for P 


for three types 0 


= 


Fig. 1 is an example of this form 
Processes to be discussed below. 


1.0 


e০০০ ০o 
ce ov 2-0, i 
° SB Se 
/ 


f 0.8 
Theoretical LPF Probability of response 1.0 
f stochastic snl lat Bsa function of response probability) 
(and a T choice 2 


ক R= 
0, for accum ae pn) e, k=2; O, 


cr 
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Iti - 
oe ant os in the two-choice situation, the probability of the dominant 
dominant Fes y the Correct response) will vary from 0-5 to 1 and the non- 
discrimination will co-vary from 0-5 to zero. Hence, for any range of 
Dararmeters, bai X Hs or, in the theoretical case, the associated range of model 
The general or Hf LP points will be generated giving the required function. 
acon (domi of the graph will also give the difference in mean times between 
probability of Es and incorrect (non-dominant) response for any particular 
refer to such a JiR correct. The term ‘ proportional time difference’ will 
Few-theor ifference derived from the proportional LPF. 
fons, of the IR in the field of choice behaviour have been explicit about the 
time difference Lb and not many more appear to have discussed the associated 
Ths dunstion etween correct and incorrect responses in a two-choice task. 
Lab log P ত considered by Luce (1960), who derived the equation 
to the data of j and then proceeded to cast doubt upon this equation by reference 
Kellogg (1931) (Fig. 2). Luce points out that the mean latencies 


1.279 


= 0-5 


Mean latency / Latency at P 


0.8 


(b) ‘Equal’ allowed 


0-2 0-4 0-6 0.8 1-0 
Probability of response 


P I ন 
roportional mean latency as a function of response pro 


f Ji 5 2 : 
of light intensity. After Kellogg (1931). O, response equal. 


FicurE 2, 
bability in a judgement 
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0.2 0-4 0.6 
Probability 


0-8 


ility: median 
Oportional mean latency as a function Of response i position. 
latencies for 24 Subjects in unbiased Sroups in a Judgement of spa 

er Laberge (1962). 


12: it 1S 
lated response ang go" Com 
ruct an LP Sraph. This has been done in Figs. 8 an 
the data of Woll and Cross & L 
elow, together with other 


NCY IN Rerarioy To ResPoNsE 
PRonapy ry 

As mentioned above, few theories of cho 
the problem of differences between Correct 
example, difficult to Obtain a ite Prediction of the 1 ed 
the model of Cartwright stinger (1943). Wichiis,otheryise well develop 
(and ‘fitted ’ to experiments S fe I will not be. discussed. 
appears that only the Stochastic models 


Statistical decision models 1 
Stone, 1960) have been Stated explicitly enough be compared with 
Obtained from Psychophysical dis, 3 


" J 
crimination, (The latter models may usualy 


ice have been 


্ ing 
explicit concerni 
and incorrect 1 


or 
atencies. It of El 
atency difference fr 


Ea 
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be interpreted as a generalized form of ‘random walk ° (cf. Feller, 1957) and 
may, therefore, be subsumed in the former.) Before dealing with the stochastic 
models, two theoretical approaches of a different kind will be briefly discussed. 


3.1. The Hull-Spence Approach 


লা The relation between response tendency and latency for the case of a single 
Hage Ure has been discussed by Hull (1943) and Spence (1954) and it appears to 
ক! cen interpreted by some as indicating an inverse relation in more compli- 
He Situations. Wollen (1963), for example, appears to assume that a weak 
EL nse EA will result in a longer latency in the two-choice case. How- 
re such an assumption is by no means borne out if an attempt is made to 
deals ort the Hull-Spence model in terms of competing responses. Spence 
ন sete with relative frequencies in this situation but a derivation of latencies 
a ন re je. He shows that in the case of a single latency the postulate that Hull 
earl ed (RT = aE, where E is effective excitatory potential) is inconsistent with 

er assumptions. He draws upon work of Estes (1950) showing that response 


latencie টি Ln 
os IS form a geometric distribution, resulting from the theoretical model 
scribing the momentary oscillations of E about the fixed reaction threshold 

bility 7 that a response 


ডা a EnEly oscillating inhibition (i.e. there is a proba L : 
dist in any discrete time interval, given by the area of this DONE 
er ion below the threshold value and hence a mean time to response 0 
asic ৰ ey dealing with competitive responses, Spence (1960) uses the 
iffere odel, with each response having the same threshold and oecilatn ut 
a lat nt mean excitatory levels. It is possible to combine these ideas to form 
ency model for the two-choice situation on a particular trial. According 


to ‘ na s 
the theory, in each moment of time from the beginning of the period when the 


টব nism is confronted with the choice-point there will exist (1) a BOE 
ihn response occurs, given by the sum of the areas of both fr Nay 
respo utions below the threshold level; (2) a probability that the TEE 
Sn Se will occur, given by the product of the dominant area Eee হে i Rs 
omi. ant area below plus the probability that if both are above et 0 
Donde ant has a greater excitatory level; and the remaining proba দ* - Ee 
e Sale response will occur. These probabilities may i রা! 
is is 8 of the theoretical parameters and the results are given Dy pence, 
ei লালা চক কলাম i 1 f the excitatory 
Potenti টা now easy to show that, for any specified constant va ues ph Mey 
Situation S— which would be the case if this model were to be adapte oo 
identical Considered here—the resulting response distributions are geometr 
fee OUD response times. This is because in eac| 
to the rsa events, described above, will occur an 
no Eo ed event, giving a mean time of 1/1-r, whe 
tesponse nse in any time interval. Thus, in the competi 
tendency has the same mean time as its dominant coun! 


di 


ng situation the weak 
terpart, although 


but they will not be Considered here. 


€ correct and incorrect latenci 
Processes is that of Takada (1960), wh 
of two lines. Hi 


of 
f ac: 
iscrimination, so that the Le The 
S upon the efficiency of that inl with the 
a I i i ia 
crucial point is that this effici 'Y from trial to tr Hence he 
’ taking a longer time to perform. ut Bods 
Predicts that Correct mean times should be longer than Incorrect ones, 


Because the 
is not Confirmed b 
[J 


ere is litt] 
©, then it is imes an' 
fo Ur nly interpretation to explain longer correct time 
Laming (1968), for ex le, uses the notion of 
1 Sampling to do so 
Process which is 


ent 
ables dior 1 “S50 there is One version of a DET el 
€ to predict Shorter times “ Correct responses in difficu 
tasks: Audley & Mercer (1968) have show, that T 
do this by Assuming th 


‘+erions 

at the closer the quality On Titerio 

ot Of information is to the c g 

then the longer the decision Process for Placement to one side or the other a 

that criterion. ence, since Means of the i ই ‘ inform 
দা A) Ae 2 “correct portion of the 

tion’ (likelihood.ratio) distributio i P 

Correct means, th 


the 
Nl usualy be Tiearer the criterion than a0. 
© sult follows, 1S is similar to the Coombs & Torger 0 
(Coombs, 1964, p. 530) Mterpretatio t urstone distribution 


1 
€ Signal detectability model w 


rms of the 
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differenc Sn, SE ; ; 

in EE ক কত y dispersions 2, Shorter mean latencies for errors 
by Rabie (1966) ৰ have been interpreted in terms of information capacity 
af perceptual ts EEE by Rabbitt & Phillips (1967) and in terms 

Here, however ত SR J Coon 

byte or Re » We are more interested in discrimination of a kind which is, 
The possibility রি more plausibly explained in terms of stochastic processes. 
toipiopose eg Ee form of indecision may occur in the system leads one - 
Sartielsy pant which that system assumes various pre-response states with 
AHOther me টি 5 and the transitions between one pre-decisional state and 
PresSDEdL arts ise escribed stochastically. It is hoped that the arguments 
SESUTanatrem EE will make this proposal more acceptable. In the next 
Process may 2 made to show that some fairly simple forms of stochastic 
graphs. or the observed time differences and form of the LP 


3.3. The Stochastic Model for Choice 


ত models for choice behaviour have been described by various 
and Pike (1966), Audley (1960), Estes (1960), Bower (1959), Laberge (1962) 
of finite Mark ). Usually, the processes proposed have been some simple form 
নও Sotedes chain (cf. Feller, 1957); and the general term ‘random walk ’ 

will be Fade een used to describe them (e.g. Snell, 1965). Herea distinction 
response late HE three main types of model, and their predictions concerning 
of the Edel ine. in the two-choice situation will be examined. The division 
Will therefor hes three main types follows Audley & Pike (1965) and little detail 
competin b e given here. It will only be stated that each process consists of 
absorbins ubsystems and proceeds from an initial zero state to a response 
g) state, that the threshold for a response is k pre-decisional events 


steps 
্‌ধ ac i B 2120: o 
» accumulations, run length) of one kind and that the probabilities associated 


with 
Egor Ae SA event are taken to be p andg for the correct and incorrect 
Such event SE where p+q=1 and p>. Discrete processes, with one 
ie n each time interval, will be examined, but the comments 
s time processes (i.e. the 


eve 
er, 1957). 


nts oc £ 
Cur according to the Poisson process—see Fell 


The 
"lis Model 
n thi s 
Pre-decision model the response threshold is de 
events in ওম events of one kind, regardless of the sequence of the different 
Ime. In the operation of this process, correct latencies are always 


short 
er th: s 
€ ratio ke incorrect latencies. This comes about for the following reason. 
tesponse at ss (conditional) probability of a correct to that of an incorrect 
iscrete time n decreases as nis increased from k to 2k — 1, the latency 


Tange 
of thi 
is process. ‘This ratio is of the form 


R,=(plg: (k<o<2k-1) 


fined as an accumulation of k 
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st a i 

for the response at n=v. Hence, the incorrect latency distribution w ee 

reater mean than the correct distribution. This latency difference Ee রী 
nh the value of k, but it appears that the LPF will remain fairly Sn oe; 
form if expressed as in Fig. 1a. An expression for the mean latency 0 
responses is given by: 
k 2k-—I1! qrpE-1 
p KE-ITR-T EB 
Probability. This is derived, by differentiation, fro 


Presented in Audley & Pike (1965), and the mean 
Onses is similarly expressed. 


uy, = 


Where P is the response 
the generating function 
latency for incorrect resp 


The Runs Model 


Here an overt Tesponse occurs when a r 


n from Fig. 1, the LP 
€s of k: for k=2 it is 


un of k events of one kind A 
graph for this process varies wie 
decidedly asymmetrical and Oe 
However, with increasing Rk t 
Proportional time differences ten 
ies for p=0.7 are as follows: 


Proportional 
difference 
k Correct (C) Incorrect (J) I-C/Mean 
I 2.75 2-96 0-075 
5 14.15 14.35 0-014 
930 930-5 0-00054 


The ASYmmetry 
bilities of Tespo 
Variation quick 


1s again due to variation in the ratio R, of conditional proba- 
ন at time k, R+1,... (an infinite range in this case), but this 
Y Decomes negligible as the threshold value is increased. 


TR By 

i - » 2S shown in Fig. 1. 2 

Constant at the valu Re this conclusion ediately, since this ratio is 

difference between the (Bla) (et. Feller, 1957). ence, flere is Ho latend 

nual go Col we Ee 

ন) শল. EE ed (i.e. equal theahald) case. This 0 
(cf. Feller, 1957). Ts are of the form prgi(0) and ggnl 


Y given pro 


e Moments Of the Jat 


ty in the uUunbias 
nt Benerating functio. 
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4. FACTORS AFFECTING THE FORM OF THE LPF 


. Modifications may be made in the three stochastic processes which lead to a 
time difference between correct and incorrect responses whether or not it already 
exists in the basic models. The main forms of these will now be discussed. 


4.1. Bias 


It is apparent that any of these systems could be modified to represent 
Phenomena corresponding with empirical bias towards one response or another. 
Such bias may be regarded as being due to sensory, perceptual or response factors. 
Bias which is judgemental in origin is also conceivable, and this would therefore 
bear some relation to variation in the decision criterion as in signal-detection 
theory. In stochastic systems of the above kind, bias may be described in terms 
of unequal values of the threshold for response (k-bias) or it may be associated 
with the values of p and q as, for example, when the left-hand member of a pair 
of stimuli always appears to be relatively greater. In the latter case, a bias 
Parameter, either multiplicative or additive, would be included in the model. 
One consequence of these modifications is to bring about a relative decrease in 
latency for the response toward which the system is biased, although it 1s not 
fenerally possible to describe these effects in any simple manner. However, in 
the ‘Case of p-bias—and for threshold bias in the random walk—it is possible to 
derive the consequences simply in the following way. 

. Consider the general expression for the difference between correct and 
Incorrect latencies when a position bias exists in the two-choice situation. This 
means that the response probabilities and latencies for correct and incorrect 
Tesponses will differ according to the stimulus arrangement and must be averaged 


t ‘ ক . 

i Obtain the overall values. Specifically, if Pi, Pa» ty and 1g are the Co 

sponse probabilities and latencies and Oy Q2 0v 02 the respective incorre 
থা t and correct 


Probabilities and latencies, then the difference between incorrect anc b 
Mean latencies (or between non-dominant and dominant latencies) is given DY 


U~—U, = Owi+ Qs _ Pit, + Pats 
lb O0,+ 02 Pi+Ps 
(02-1) +PsQ0 — ua) + PoQ 0s — U2) A) 


= PiQ(o—u) +PiQ1 ) 


(E+PNQit [d 
I In the case of D-bias, k remains constant and the sign of the latency 1 
Ay be deduced from the known form of the unbiased LPF; assuming i 
EE ie 0-5 so that the four points of the curve relevant to eqn. (0) a HE 
‘At to right, (Q,0;), (O202), ( Pts), ( Pui), with the order of the L values dep 


Ing upo 
1 the form of the function 
ly A and 
For the random walk, t=; and u2= 02: los ST চি জৰ Hes 
5 last terms in the ৰ 
,. Hence, in eqn. (1) the first and ts the third. 


2 UY, 
the remainder is positive since the second term exce 


Vanish and 
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‘Thus this form of bias in the ra 
are longer than correct latencies. 


now two cases of eqn. (1) to consider: the third term in the numerator is either 
negative or positive, since u, may b 


C ce € greater or Jess than v,; but in each case the 
latency difference is positive, 


Since term 2 will exceed ter ট a 
mM 3, and the same 
clusion follows as for the ran. 


dom walk. For other than very small values of k 
the separate Proportional time differences should be very ely Zero. 


1.2 


ms of the equation are positive 
ven in Fig. 5. F: the overall effect is slight and an 
ere is no requirement that the 
Ments be equal. [ 
© Processes considered here, al 
for the dominant response. 


FiGURE 5. 
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Co : a 3 

the his ক ন ত k-bias are arrived at easily only in the case of 
Hentioned ave Re the special properties of its generating function 
en DON টী iffering values of k are now constant with respect to 
stu acer chan ন be e values of p and are constant but alternate as the 
tOiEtinulis a Hl ence, the response probabilities will change according 
conn ith po ST The mean latencies are no longer equal but remain 
difiefente, inthe pee to position, the differing probabilities producing an overall 
Sle derives Feet and incorrect latencies. In fact, all the moments (i.e. 
fenise, of the m.g.f.s weighted by response probability) will show this 

t follows from the fact that the m.g.f.s are now: 


Es phiu(0), — gz0(6) 

one stimulus ar i 

dnt rangement (k, and dominant stimulu left) whi 

Oclated response oS Tice s on left) which has the 


and pHU0)=Py, ge u(0)=On 


f giu(0), pie o(0) 
or th 
€ other (k, and non-dominant stimulus on left) and with 
guu(0)=Q»  peul0)=Ps. 


The fi . a 
are given Stunt u, v differ only in the parameter k and the me 


an latencies 


1, =u(0)/u(0) and  us= ’(0)/u(0) 

It can be shown from 
herwise obvious, that 
=02 and u2=0y, the 


for 
the re 
$i | 
e tenes os associated with k; and ks respectively. 
When he expressed in terms of k and p, if it is not ot 
Mumerato 2 then P,>Ps and u;<ug. Hence, since ui 
T of eqn. (1) reduces to 


Which i (us—uNPi0:— P20), 
S 
Always negative. Thus, it must be concluded t 


Onger 
js correct latencies in this case. | 

S0 it ap rm of bias in the accumulator produces 2 fattening of the LPF, or 
Pears from some computed examples, but this will not be illustrated. 
ly len PF, so that incorrect latencies are Ccon- 
An now gthened. This would be particularly important for this model; 
ily with experimental data since the mean latencies 
oximately equal, 


Je of this biased process LPF is 


hat correct latencies are 


arate kh 
Arying Toor Lt categories are NOW 
Biven in Fig Ge to k, and ky. An examp 


Bias i ট 

i 1s -! . . . 

Will be ref » Of course, an important factor in signal-detecti 
erred to below 


on studies and this 
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0.4, 


0.2 0-4 0-6 0.8 1.0 
Probability 


trial 
Suppose the Values of the underlying Parameters p, g can vary from tr: 
totrial. Itis clear that when th 

>0'5)is at a hi i J En SE 
there is a greater Probability of the c Ponse occurring (ie. ) The 
Petition in the Process is more one-sided than at the Average value of 2) 0 (and 
» Nearer to 0.5, are in DEER erent E 
for g). Hence the time di 

ials. 


2 bilities 
Preciable is Shown y calculating response proba 
and latencies According to the Beneral form 
Dah, mn X atPimyo / XY api, 
Ee i 
Where a; is a 


ea frequency Value (or density i 
the distribution (heterogene; of p such that 
associated Tesponse Probabilities and 

for heterogeneity in th inom 
mean value of 0.6 and k=4 in the random 
rect latencies of 9.5 and 10. Tespectiye] 
0-92 in contrast to the equal late 
overall latency is du 


latency resulting fro. 


of 
e 
in the continuous tae 
=] and P, and mi, ও ag 
Moments, For example, one ith 2 
€ form of a binom; Of parameter b c0rd 
Process gives correct and lity of 
Y for a COrrect response probabi lowe 
CY Value of 13. in the usual case. ‘The an 
jPrently, to the i alance ir response frequency on 
mm heterogeneity and this is One aspect of a more gen 


বা 
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influence upon the shape of the LPF. To investigate this problem, it is 
Necessary to consider simplified examples of LPFs and it would seem that 
Symmetrical or near-symmetrical LPFs will be skewed to the left; this is expected 
from the results for p-bias described above, which is a special case of hetero- 
geneity with the distribution taking two values according to stimulus position. 
. An important point is that the effect of heterogeneity is to considerably 
Increase the skew and kurtosis of latency distributions. ‘This would appear due 
to the fact that the components of these heterogeneous distributions have a 
common minimum latency (i.e. threshold value) in the discrete time sense. 
Latency distributions from psychophysical discrimination tasks do not appear 
to have very high values of these statistics, in contrast to simple or choice reaction 
time distributions, and this feature should be taken into account: it will not be 
done so here, but elsewhere some empirical comparisons will be made. 
fi In contrast, heterogeneity of the threshold value, k, results in a tendency 
Or incorrect times to be shorter than correct. This arises because the longer 
responses, which are associated with the higher values of k, will have a greater 
Probability of being correct. This particular form of heterogeneity could have 
avo POrtant application to the interpretation of differences between latencies ৰ 
ins structed and speed-instructed tasks. For, if the result of a spee 
~tuction is to introduce a heterogeneous and lowered threshold value into the 
Scrimination process it becomes possible to account for the reversal of latency 
EES (dominant response having a longer latency) in some speed SHED 
Jects—e.g. see discussion of Greenberg’s results in Coombs (1964, p. 535) an. 
ollen’s (1963) fast responders discussed below. This is illustrated in Fig. 7 
tha =1 and 2 in the accumulator, where all correct responses are now longer 
1 incorrect. This result will be considered further below. 
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et Y of this 
hould be noted that the k-bias discussed earlier is not an ie tial, 
RE eity which implies an equal value of k at any par Loree 
ই ক between trials. In fact, this fluctuation ee 
a সা in attention ’ and this Possibility will also be discusse 
to vV' 


4.3. Decrease in Threshold Value 
If the threshold for a res 


trial, it is now by no means obvious 


to 
i 1S a umed 
alk Process, in which the value of k is ass 
decrease by 1 with each time i 


intervals 
rval that the process A ac oHiect 
without a response being achieved. For consider the probabilities 0 
response at times k, k++ REE 


k: pk (k steps in one direction.) 


k+l: kptg (The Process ha: 
incorrect direction wit 
with k steps in the co 


k+1 k—-2) 
h+2: ( চ ) Pg: (Two ‘backward? Steps with decrease of threshold to k 
And so on until 


S taken one step, 
h the result that t| 
Trect direction still 


in the 
in one of k possible be 1, 
he threshold has decrease 
being necessary.) 


‘sions have 
When k-1 Correct and incorrect Rie-decislons Jd 
Re + sho 
t pre-decision being Pecessarily decisive, the thre 
ced to 1. No latency will Occur which is longe 
Ml lator 
In the above Case the Probabilities resulting are those of the accumu ares 
Process with pte Pb and k €.g. see Audley & Pike, 1965) and ee 
Of course, Similar for the incorrect 58 ber 
es ip de Fesponse, Hence the proce F ee 
it is already 


€ reason fo i 
One reason T descril Proces 


1 ing thresh 

is to take into account the Possibility of Biles ee o! ft Pre time with 
accuracy. It appears Plausible that at least some subject ll be willing 9 h 8 
this and itis a Conclusion of the T studies of Snod J 1. (1967) that suc 
trading occurs and Should be further investigate q Brass et al, ( 


old 
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5. SOME EXPERIMENTAL RESULTS 


Consideri 
Jere UG the mean latencies, the results of previous work in 
with a single level LE such as that of Henmon (1911) (in an experiment 
(1963), IDLER 0 difficulty), Kellogg (1931), Laberge (1962) and Wollen 
insofar as the oO in favour of the stochastic process theory of choice behaviour 
from some related ECE: latencies are generally shorter than incorrect. Results 
tonitlerg: resnons ১৪ are similar, especially those in which the subjects have 
example, this sit along some hypothetical ‘ gradient ’ of generalization. For 
Gibson (1939) on has been studied for a series of vibratory stimuli by 
shorter and ¥ found that the average latency of wrong responses was 
latency alee Fee med this was the case, contrary to expectation, because long 
Situation, EN been inhibited. Later, Mednick (1958), in a similar 
expected negativ i ibson s prediction that if errors were not emphasized the 
€ correlation (i.e. between latency and probability) will be found. 
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rived from graphs giving 
ulus intensity for 20 
ditory discrimination 


RE 8. 
LP graph. after Gross: & Lane (1962). PIU de 


Probabili 

ba . . . 
Subj of response and latencies as functions of stint 
task. —two response modes combined ina single stimulus au 


Bus 
S (19 
দি ১% had previously found the inverse relation in a study of concept 
and generalization. Pierrel & Murray (1963) found that each of eight 
Cross & Lane 


Subject 
Sh 
(1962) un Shorter correct latencies in a weight-lifting task. - 
similar results for 20 subjects and an LP graph may be derived 
bility of latencies for low 


from the; 
Probabilicic as presented in Fig. 8. The large varia 
he EE demonstrated in these results. | 
S may Bl theory of signal detectability has not specifi } 
e incorporated within its applications to human behaviour. 


ed how response 
his 


latencic 


176 A. R. Pike 


ical 
is unfortunate, since there clearly exists a correspondence between EE  - 
discrimination and the signal-detection situation (cf. Lrelsman { Sn] sign 
‘The choice model of Laming (1962) is partially based on Ee cla: nodels in 
detection theory and it should not be difficult to construct SIC piles | 
which the theory plays an intrinsic part (i.e. with the REC udley & 
given directly by that theory). A very simple version is Proposee 2 invessell 
Mercer (1968) and is sufficient to indicate that if response latency Var ‘on, then 3 
with the ‘ distance ’ of the decision point from the detection criter sb 
continuously decreasing LPF is generated for the S 4+ N and N hE been 
tions as these vary in distance from the criterion. Some latency results which the 
obtained in the detection situation: Sekuler (1965, 1966), in a study in C4 shorter 
signal was masked by an immediately following noise stimulus, OTE of three 
latencies for correct ‘ Yes’ responses but no difference (at least in Hiained, in 
subjects) in latency for ‘ No’ responses; Carterette et al. (1965) Eo incorrect 
the equiprobable case, shorter correct latencies for Signal and shorte A subject 
latencies for Noise for two subjects, but the opposite results for a thir d 
It would thus seem that bias effect 


e 
Hee Jfenda 
S are strong in this situation. hee 
(1957) has found that latency distributions in signal detection are i Ff rmation 
in form but fit better a decision System based ‘on runs of classifie 


from discrete observations °. 

With regard to the LPF it w. 
argument is whethe 
for P=0.5. That 
Wollen (1963), 
involving disc 


at of 
ill be observed that an important Pe value 
T Or not the first half of it, for P< 0-5, falls be oT i data of 
it may do so in some Cases is clearly shown in th erimen® 
Who also made use of the effects of generalization in an XE ontally | 
rimination of spatial extent. In general, for expe 
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unbiased 

groups, latency i 

to Wollen th y increased as frequency decreased, but i 

responders. et esults could usefully be separated into tC 

ঢ় Shape beige EE roups yielded data which give LP curves Oe 

that the slo er two response groups. It will be i A 

form of the pe have average latencies which onfOET 3 

have a form leafy ত accumulator process, whereas the fast FEaOndENs 
e of a low;k (or biased k) runs process. 
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mean iE 0 subjects in length discrimination. Points are averages of 
cases for each Se intervals of 0-1 along P axis. Each point based on several 
discrimination. ject who each received many trials with differing difficulty of 
RE 11, I) 
: LP a 
averages ER for 15 subjects in a numerousness discrimination. Points give 
approximatel {15 10, and each interval contains several cases based on a total of 
Pike (1968). y trials for each subject with four conditions of difficulty. From 
Pick 
hase tt i EE 
as a eT ol in a texture discrimination task, found that response latency 
nverse relation (64 out of 88 cases) with response probability and 
He appears to be in favour of 


nts gra i 
Braphs similar to those presented here. 

on, and these would be formally 

some results 


Figs. 10 and 11 are 
rted elsewhere. Fig. 10 gives 
k involving discrimination of 
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Optio, 
"Ptional ston; 
Pping ’ models for discriminati 


imilar 
Sata ga jy sy “ochantio models. Given in 
ig 8 Average LP writer which will be fully repo 
ata extent EE five subjects in a tas 
lat riments Ay e details of which are given in Pike (1968). For this series of 
are CICS and if he of 54 pairs of correct/incorrect responses had shorter correct 
the averaged Be e results for stimuli of equal difficulty (usually right left pairs) 
dots 2B for ৰ only one out of 27 has correct latencies longer. Fig. 11 gives 
Subjects in a numerousness discrimination (randomly spaced 


flat Ww en o 
LP টী 18 of 29 pairs had shorter correct latencies, as indicated by the 
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6. Discussion 


in the intr 


ng 0 
ৰ 1S 
ue : ic distributio 
in the competitive Sense) results in geometric a symmet 
qual response times and there is little appearance 
in the LP data illustrated here. 


hat any 
The point may therefore be made th 
model which attempts to describe 


EN 
. repres' 
behaviour at choice-points সজ i or 
the underlying indecision. This model is, of Course, already stoc a S apsyste 
and modifications Would only make it resemble more the competing 
type discussed here. 
Itis clear that t 


nab’e 
i "আত a ive a reaso 

he stochastic models representing indecision give 

St approximation to latency data. 


h data 
It is also fairly obvious from ne of if 
te random walk model in its simple form cannot be accepted bec 
Symmetrical LP cury 


ropo 
ve. 8 Variation of this model has been proPO™ 


An interestin tion tT f 
by Sektler (1965), based on MeGilPs (1963) counter model for reac 0 


incorrect ¢ No’ 


ect 

model to explain the fact that ne er 
© Tesponses have equal latencies does not SDpear Se Co 
satisfactory since he has not developed this Particular approach (see ions Hf 
1966). Heis clearly aware of the Problem of bias Present in these situati of 
advocates the furt er study of it. The Study of Carterette et al. (19 1 ject 
further high-lighteq this Problem with the data of two of their three hl gr 
being clearly biased in opposite directions to the third. These workers ৰ pl 
to পকা কা 3 ke tS interms of a two-dimensional random walk a te 
aware that the one- imensional i i 
clear how the model combines i. Latency model heap ত 


ion the) 
na ith the detection © 96 
Carterette he (1965) also criticize the Btn (U0 Ed Laberge nerdy 
models in terms of frequency distribution type; their latency data are Fim Ye 
of Pearson distribution Types I ang vা (bet) or Tyne TV ন d they clay vr 
the random walk and Accumulator distributions ar: টি pe TI tna): bl 
similar criticism is made of the use of th € {ype 8 


10 
12 

: ene 

° gamma distribution in £ 
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y Rel 7 71S D. 
ative Fi equency 0 Respo 2 ych LYSLC D 


Snodgrass e 
dre oo Lee চত এ that the Laberge (accumulator) model is a 
50: adit whats a wv nma, the negative binomial’. But the latter auth 
gamma distribution i ery important point as far as any criticism based is 
relation among the is concerned: that this Pearson type corresponds a 
empirical distributi moments and there is no way of determining oN le 
Values of skew aE approaches it. This is related to the fact that th eal 
distribution and kurtosis indices, which form the critical valu. FO 
and the as a Ne generally biased as estimates of their সাক বুদ 
advocated b Ce ion is sensitive to small changes in these values. (Thea ড ah 
this Tegpect So (1965) in terms of cumulants may be found more sll 
models the nes orn the Case that for the lower values of k in the stoch: ঃ 
discrete-time ac cy distributions are only approximately gamma and i the 
these and other are well within the beta classification. In ie of 
means of the ee involved in the fitting of empirical distributions b 
discussed here i ESOn: classification (cf. Kendall & Stuart, 1958), it will not be 
that averaged oh relation to the data obtained by the author except to mention 
(see Pike, 1967) 0 and kurtosis indices generally indicate beta distributions 
tion of th Sskelical nl be suggested that comparisons with computer simula- 
models than die Lo els may, in future, yield better tests of those theoretical 
data ? vield the omparisons with their theoretical features, since simulated 
| Té would s expected statistics for given ¢ trial’ numbers. 
Suitable model i pear from the empirical LP data presented here that the most 
if only simple ial Psychophysical discrimination is the accumulator, particularly 
considered ah a (i.e. unbiased, homogeneous and fixed threshold) are 
other. The ন orm of the LP curves indicating this process rather than any 
Present becau riter does not intend to conduct an exercise in curve-fitting at the 
reat: apart fr se the scope for modifications in the theoretical processes is so 
ased ones ke those modifications discussed above, there are more empirically 
or the consid E23 the introduction of initial (fe. non-response) time intervals 
Probably EE of aberrant fast or slow responses, although the former is 
aS in reacti so important in discrimination times under accuracy instructions 
be ion time studies. 
lator imo del it clear that the result 
ast respond ome of the individual 
; ers of Wollen (1963) and 


s do not unambiguouslysupportthe accumu- 
data of Pike (1968), Laberge (1962), the 
Occur jn s the fact that shorter incorrect latencies 
lL peed-orientated subjects (cf. Coombs, 1964) indicate that the low 

he value for P= J. Laberge 


tobabil; 

t 

(8st Ue the LPF may sometimes g0 below t 

964 the explanation lies in an artifactual fast-response process, Coombs 
d Wollen merely suggests 


’ Dp. 
Ra saabnc SE a ‘ speed vs. accuracy set’ an 
Sup ained in fea slow and fast responders is artifactual. These results may be 
( bposing That & OnE of the accumulator model, as indicated previously, by 
tr terogencity i fast’ responders are using i 
ading ETc R). ‘There are therefore two possible interpretations of the 
n accuracy and time in response Jatencies: one is that subjects 
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. have 
5 bjects may ha 
“thresholds ° during a trial; the other pr ding differing 
bc JR nrg between trials, with the extent i more satisfactory 
‘ thresholds ’ vary heterogeneity) explanation is 
ন The latter (heterog 
between subjects. 


in. d 
i hreshol 
ecreasing t 
icts the speed vs. accuracy results, whereas 5) ' চাম্প শা 
os et (In the random walk, in fact, it predicts t' WwW 
version does not. th 
with little or no decrease, 


Other 
i s. 

would have the more symmetrical er Hone le ie 
; Ay be given in terms of Observers Operating wit otratedLdnsink 
IDE <n El a ‘runs’ criterion if discrimination ene 1৩% Shichi be 
টে j br may have an interesting Physiological interpretatio 

a trial. 

discussed elsewhere. 
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correc 
Planation of the difference ro নন 
scriminations which demands inv 


en" 

. ? or ‘ conc 

This concerns the effects of ‘ attention 
Variation of th: 


te tO 
Correct and need Ne ন now 
1 
theoretical heterogeneity Of the type discussed above. In fact, the 
indications from Physiological Work (e.g. Fu 
1967; Wi 


: Haider, 
ster, 1958; Davis, 1964; Haice’ 
t something of this kin 


uld like to t 
Very useful criticis 
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HE RE 
ee eo BETWEEN DECISION TIME AND RELATIVE 
NSE FREQUENCY IN A BLUE-GREEN DISCRIMINATION 


By R. J. AupLey and ANNE MERCER 
University College London 


nation, decision times were 


Usi 5 
sing a two-choice blue-green discrimi 
frequencies of approximately 


me: রি yj 

EE i De yielding relative response 

support ie an values close to zero and unity. The results were shown to 

choice. It i iction of a so-called accumulator, or recruitment, model of 

assuming that her shown that the results could be well accounted for by 

vary in qu at the decisions were based on fixed samples of information which 
ality. 


1. INTRODUCTION 


ene fo Pike (1965) subsumed some models of choice in a simple general 
Sized that in DH: Fy OETEETS facilitate comparisons among them. They hypothe- 
in time VELEN Sl choice situation, discrete units of information arriving randomly 
Assumed that stored according to the alternative they favoured. It was further 
Certain s on t An. overt choice occurred when the contents of the stores satisfied 
(1) the ee conditions. Three models were given particular attention: 
1s made (1 Ha soon as one store receives k units the appropriate response 

€ same X ’erge’s (1962) recruitment model); (2) accumulator with inhibition— 
clears all ecision rule applies, but the arrival of a unit of excitation in one store 
TS all other stores (Audley’s (1960) ‘runs’ model); (3) random walk—a 


Shine fs 
ice is 
made when the contents of a store 


Number of uni 

sche units k (a simplified analogue © L 
} j Ri nsidered i a ye for elojee by Stone (1960) and Laming (1968)). 
ed oT out a mathematical treatmen' 
tequenc that the relation between the tim 
d Y with which that choice was ma 


icted fe 

ৰ ৰ : 
Hculgy কি for the three models (shown in Fig. 1). . 
to deci se EOL situation they envisaged was one in 


im es each of a series of trials, which of two stimuli Ww ree = 
telatiye pf stimuli appearing with equal frequency in 8 random A 
the diff requency of the alternative responses Was to be manipulated y Vary 
biases culty of the discrimination. In this manner it Was hoped to avo ত 
n favour of one of the alternatives aS happen, for example, when variation. 


In th 
© relati 3 i ্‌্ 
Mor. ative frequency of a response are achieved by ৰল 
ries in the 


hd 
Proce tequent. For one thing, the asymmet bp 
& trent pe would usually be ovepulated to handle this latter চপা! 
Values for the different responses) considerably diminis bias 


ces b 
et ন 
Ween the predictions from the alternative models. 
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The 


R. J. Audley and Anne Mercer 
184 


0 0.5 


1-0 
P ve 
Jativ' 
বিছি Hl the re 55 
FiGuRE 1, Theoretical Prediction of relation between decision time En = Runs’ 
frequency of a Tesponse #52). = = = Accumulator; ERE 
TER = Random walk, 


18 

ৰ f a 
Into Account), but also the excitation might arrive at nt t 
Point needs fu oat: 

state that the results of the prese 


hp a 
nt experim, d 
effect to be postulated. ine af 
The first author has been i elation between ted 8 if 
relative frequency for individual ese data will be IPT ber 
sequently. But it was thought Obtain results from a n 


Vestigatin gy this r 
Subjects and th, 
desirable to 
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individuals in such a way that they could be meaningfully averaged to throw 
light upon the general form of the relation in question. The experiment 
Teported here was designed to obtain decision times for a few strategically 
selected relative frequencies of response. Those chosen were 0-10, 0:50 and 
0-90, together with some observations on responses with relative frequencies 
Mose to zero and unity. By confining attention to these selected points it is 
Possible to average subjects’ times and still obtain theoretically interesting 
empirical results. Furthermore, by expressing an individual’s times relative to 
that he takes for the response with a relative frequency of 0-50, it is possible to 
eliminate any effects due to differences in overall rate of responding. 
he discrimination of colours was chosen as the setting for the experiment 
ecause it was hoped that the subject would bring to such a task already 
established and relatively stable perceptual templates for labelling the hue of a 
oN, . This meant that there would be a greater chance of him being able 
tives Ee each member of a whole range of different stimuli into two alterna- 
re an could ‘be achieved if he had been required to dichotomize some other 
pi sion (say, Intensity) which would have demanded considerable pretraining 
OF to the experiment proper. 
‘The blue-green range was chosen because it is supposed to be relatively 
tial Of stimulus anchors (Lenneberg, 1957). It was therefore hoped that each 
Would require a genuine categorization from the subject and would not 


Merely iny oye the re-labelling of a number of relatively identifiable hues as being 
Ue or green. 


free 


2. EXPERIMENT 
M 


require The Subjects were presented with hues in the blue-green range and 

epressin 0 decide whether these were blue or green and to indicate their choice by 

্থ Reser one of two appropriately labelled morse keys. The time elapsing between 
Variable A of the stimulus and a key being depressed was the main dependent 
ment. alf of the subjects were also required to rate their confidence in each Judge- 
the remalt Order to ensure that this did not in itself affect the pattern of decision times, 

lee Subjects were not required to make this additional rating. 
পি Breen hues of seven different types were presented, the actual stimulus values 
Pparately gauged for each subject. 
10%, 500 © of the mixtures were chosen so that a subject would produce respectively 
(e.g, and 90%, blue responses. This was achieved by using a ‘ staircase ’ procedure 
Produce Wetherill & Levitt, 1964). In the 90% condition, for example, if the subject 
Occasioy the Consecutive blue responses to presentations of this type, then on the next 
cver pro Ei Stimulus mixture was set one step closer to green; if a green response was 
Which will ced, 4 step was made towards blue. This procedure leads to a series of stimuli 
for + otf Pllize at a point where on 50% of the trials six blue responses are made and 
tesponses REESE there are fewer than six. If P is the relative frequency of blue 
‘wo of 1s stabilization point, then PS=0-5 and therefore P= 0.89. 

expecte, dE e mixtures were set farther out towards the extremes so that they would be 
even yy yield virtually 100% consistent decisions. The remaining two were put at 
“Teme values. The seven conditions will be labelled 0, 0, 10, 50, 90, 100, 
ely. The stimulus conditions were presented in random order to the 


being 
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Rt 10, 50 
bject in five blocks | i i i f conditions 
sul ks of 32 trials. Each block contained eight trials 0! 
Uo : d two 5 of each of the remaining conditions. 
and 90 an 


ec en undergr en! iversi London. 
f Universi College 

jects "TI dergraduate students 0° 

Subj ty 


ৰ blue 
5 fitted with a Ea 
ত Pickford, 1949) urce 0! 
i d-Nicholson anomaloscope ( S 5d HEHE 4b 
EE oa) and a green (wavelength 520 Tp) Ww mounted on the el 
filter SRE fee A 12 in. diameter Perspex wheel w যা be obtained betw' 
EO KetON Tn i ercontrol of the anomaloscope so that 78 steps cou 
of the hue-mixin: 


rn] 
ay from 2 
range. ‘The subject sat 6 ft. ing ght ্ঞ 
f ৰ A warn d the 
in. in di i Wn hardboard screen. ale 
aperture 0°7 in. in diameter cut in a i SNE ES Ee contol ্ণ 
i subject's respor r in 
i f the stimulus which was terminated by the subje Hy experimente Rf 
presentation 0: 1 was carried out manually by ar f the stim 
den from him. The presentation 0 


these 
0 ision and 

© morse-type keys to signal ন was also 
ndex fingers, respectively. A microp 


ents 
judgem 1 
in the adjoining room to note CON te subject's 
ৰ i Tr 
of confidence required from Some subjects, This experimenter also een 
atron timer to the nearest 10 milli 
As subjects varied 


an 
icular hue, £ 
Breatly in their naming of any particu 5 
approximate 50%, point for each subj itions were! 
settings of the stimulus. ‘The starting values for the 10%, and 90% conditi 
at three arbitrary scale units from thi 
Were placed three further 


Units towar 
four or eight units farther 


int was towar 
Out. (For five Subjects, their most extreme point ing 
€ green; for the others it was towards the blue ) The warn 
imuli were Presented automatically at a rate of one every 14 sec 
Signal came on briefly and Was followed 1 Sec. later by the stimulus 
Ubjects were instructed that they h: 
trial was 


3. Resurrs fora a 
The data were analysed in two Ways. Tn the first, all the data 
condition were Used to Calculat, 
only data used Wer 


2 proportion of correct Tesponses 
was hardly Any difference in 
frequency obtaine 


d by the two 
based on the first. There Were 


ne 
co iM 
his od 
Was Standardized by expressin it as a ratio tO ot! d 
iti i 6p ol 
in condition 0-50. ‘These ratios Were then ee Ls Fig. 2 they ar resho 
ical curve for the a. 


= t 
ccumulator model (with the 
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Value k=2) which describes the data quite well, although the empirical decision 
time for correct probability is rather higher than that theoretically expected 
(the monotonic trend in the ratios is significant with P<0:-001 using Page’s L 
test). For the accumulator model, the expected decision time ratios for the 10 
and 90 conditions are: for k=2, 1-05 and 0-91; for R=3, 1-06 and 0°89; and for 
k=10, 1.05 and 0.90. The obtained values for each subject are shown in Table 
bs and they have mean values of 1.16 and 0°88 respectively. ‘These values are 
fot significantly different from those predicted from the accumulator model with 
0 = (t=1.79, P>0.10; and t=0.94, P>0:25, respectively). ‘The ratio for 
ondition 10 is, however, significantly larger than that predicted from the runs 
no with k=2 (t=2-90, P< 0.001) and therefore a fortiori than the theoretical 

Ue for the random walk model. For condition 90, the observed ratio does not 

Ver significantly from that expected on the runs model (t=0-37, P> 0-25). 
Diese bd analysis of the response time data for relative frequencies near 100% 
necti nts Special problems which will be discussed more fully elsewhere in con- 

lon with the results of another experiment. 

fe US is nothing of special interest to report about the confidence Judge- 
time S except that their presence did not unduly influence the subject's decision 
tiie For condition 10, both groups of subjects gave the same average decision 
atio; and for condition 90, the average ratios were 0-90 for those subjects 

Juested to give confidence ratings and 0-86 for the remainder. 


Theoretical 
intercepts 


0 0-5 1-0 
Ficuns 2 5 P | 
্‌ bserved values of decision time ratios plotted together with theoretical 
prediction from accumulator model. 
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ECTS 
TABLE 1. Decision TIME Ratios FoR INDIVIDUAL SUBJEC 


Observed response Decision time 
Subject Probability ratio (L/L so) 
N.0. 0-125 0:99 
0-875 0:86 
L.B. 0-190 1-19 
0.810 0-98 
A.B. 0-125 1-19 
0.875 0:99 
ZK. 0.140 1-23 
0-860 0-75 
C.G. 0.125 1-16 
0-875 0:93 
D.P. 0.138 0-88 
0.865 0.74 
GM. 0.180 1-22 
0-825 0-97 
M.R. 0.125 1-04 
0.875 0-98 
AP, 0.125 1:62 
0875 0-64 
LP, 0-190 1:00 
0.810 0-85 
4. Discussrox clation 
‘The technique employed here has yielded clear evidence on the r 0 
between decision time and relati 
and Conditi. 


a or the 
Ve response frequency. The times for t 


mes fOr $87 
ly yielded some fast times t these 
iso re incompatible with the model. tified 2° 
Re (ENTE easy discriminations have recently been iden tudie 


3 Yellott, 1967). Alth they need to be $° ost 
He Fe Ee HOt suffi ntly fede in the ও t LS eriment to sug 
at we should modify ou conclusior. Present exp 
In Calculatin 0 3 


SErVve, deci; d that Hh 
A fk CISION tiie rapioe assumed 
total time that the Subject took in making pj © tl0S, it Was table entire Se 
an underlying decision rocess, Tn f § fis choice was attributa ন 
contribution from ৰ 


ম্‌ side e 
othe Ore Act, there is likely to be a cons!” rh 
3 a 5 
effect of taking this into i 


5 imes- CY 
“8 Peripheral conduction eta i 
between the data d th stout would 0) ene slightly the discreh gol 
etwee bs t € theoreti redicti mulator i008 
However, these Predictions hay een base ions from the AA assum oral 
about the Input to the decision. 1, 4 g mec ere On rather speci টা of a Egg 
interpretation of the model is a Tace betwee, পক bye it tly W® 
Systems indep 
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ক a threshold level of excitation (see also Gleitman & Caplin, 1968). 
Tate de change in the assumptions about the input could alter the pre- 
ph On time ratios in the direction of those observed. 

enquiry SEE for the accumulator model now changes the direction of the 
rather RA only is the model capable of several interpretations, there isa 
On the trie approach altogether which yields a similar theoretical result. 
random walle Ent for example, Audley & Pike (1965) pointed out that a 
with houa গুলি model (the optimal statistical decision-making procedure) 
Especially in Ab babes shrink in time has the properties of the accumulator. 
Ole tighter Se Hr estnt situation which involved extremely ambiguous stimuli, 
concerning tle ls HTT passes, subjects would rapidly lower their criterion 
tial statistical টে eS idence to be treated as decisive. Indeed, if the sequen- 
Such shrinka a 1S to describe successfully the present kind of data, then 
expected aL rt ia boundaries is required, for otherwise subjects would be 
his eran Su 0 express the same degree of confidence in their judgements. 
Panies the A oe is happen; anda poorer accuracy in performance accams= 
ctectabilicy a idence ratings—the Very reason why one may use the signal 
Possible the th Fr to describe this kind of discrimination data.t But itis also 
to trial Varatb, LE of the present results is due to a considerable degree of trial 
Tete bes in the quality of the information that the subject is processing. 
a limited sam Es that on the presentation of a stimulus the subject receives 
On some trials io he the usefulness of which varies from trial to trial: 
quickly abe to d cellent information is available and the subject is readily and 
falls; On: othe etermine on which side of the blue-green mid-point the stimulus 
Which in oie Poor information will demand a lengthy measurement process 
Statistics itis ne leads to a wrong choice. If attention is limited to simple order 
wets, Ta to use an extension of the signal detectability model (Green 

Let PB ) to derive predictions for such a model. C f 
toportion BR a stimulus which on its presentation elicits from the subject a 
Teccjyed ঠ 9 blue responses. Suppose that the samples of information 
dimens;io; 8 hue is presented can be represented as varying in some single 
bility theo, quality—i.e. akin to the likelihood ratio abscissa of signal detecta- 

hig a Fig. 3 shows how certain order statistics can be derived. 

a P=y.5g shows the distribution of samples expected from three hues: 
fs Sy blue-green; and two other hues which produce response pro- 
nor Metrically above and below this. The distributions are supposed 
that the mal with equal variance, here set equal to unity. It is also assumed 
Which a Fes of the P=0.50 distribution is the position of the criterion above 
Ample of information will always be called blue by the subject and below 


In this f 
iS hi recently CuO it is of interest that the most highly motivated subject was an art EUG 
Accu ance to him Re a study in blues and Ercens! Every decision was of great professional 
form lator model anc one might expect him to maintain a high criterion over time; but the 
2 €! still fitted his data well and there was no tendency towards the random walk 


N 
S.P. 
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(1-P) - Blue 0.-5- Blue P-Blue 


ct 
| corre 
n of 

€ hue discrimination task. Proportio 


incorrect 
} f in ক 
time above Ms5o=A/P. Proportion 0 
responses with decision tim. 


ne of i 
ন Ing = 4: 
e above Mio= B/(1—-P). Horizontal Be Vertical 
0° ol 
Proportion of responses which are Correct and longer dE than Miso 
hatching = B = Proportion of Tesponses which are incorrect an 


Which it will be cal 


lled green, Consider the | 
to the 0.5 


5 
mpanie 
—Blue stimulus. Obviously, the median latency (Ms;o) acco 


d on poorer 
0 i 

: are both called blue and are base 

information than that requir; 


ecision 
quiring time M,, to Process, and hence have d 
times longer than Ms, is area is 
[oe] 
A=P-_ exp( — 22/2)dz, 
0:68—a'/2 
Where d’ is the di, 


bu 
distri 
mean of the P_ and (1- P)-Blue 1 


exp( — 22 [2)dz. 
12 
Therefore the 


.p have 
্‌ ch 
I VE Tesponses to this stimulus whi onses © 
Ber than Mis 4 PR imilarly the Proportion of blue resp {Ph 
Ave decisi. 


On times longer than M. so is B AK 
B= (1 i , 2) | 2 
= Az. 
008 + a2 exp(— 22/2)dz 
roportions as 


P varies i 
data points for P=0.9 bts; 
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variabili | 

ne ee could account for the kind of data obtained in the present 

Proportion of ৷ predictions within a given condition, e.g. the 

model is Synch ৰ longer than the median correct time, the fit of the 

he form ih Etter. h 

assumed Shapes. oF বত ক curve in Fig. 3 is obviously dependent on the 
Omas has very ki he distributions of stimulus information. Dr E. A.C. 

Of Laplace and Y kindly looked at the predictions following from the assumption 

OF response নাগক distributions. In the former case, the curve is horizontal 

monotonically Hb ilities lying between zero and 0°53, and it then decreases 

Or low respon or the rectangular case, there is a short horizontal segment 

Ponse probabilities and then a monotonic decrease. 


Proportion of latencies above Mso 


EE 
0 ছক ড় 
0-2 0-4 0-6 0-8 
Response probability 


1-0 


Ficv 
RE 4. Latency predictions from statistical model.  @ = Average. 


A 


ful i হি y 
Broviqe ler discussion of the relative merits of the explanation of these data 


Y the accumulator model and rival hypotheses requires further 
hich will be presented elsewhere. But the results of the present 
and others which will be subsequently reported (see especially 
preg A that to a first approximation, the accumulator provides a 
Criminat; ion of the relation between latency and probability in a two-choice 

tion task in which the alternative stimuli appear equally frequently. 


a! 
© ence জা 
sPeriment 
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DISCRIMINATIVE BINARY PREDICTION WITH 
REINFORCED CUE IDENTIFICATION 


By DoMINIC W. MaAssARO 
University of California, San Diego 


and JoHN W. MOooRE 


University of Massachusetts, Amherst 


ET tones, Ti and T's, differing in loudness, were employed as ME চর a 
events ee binary prediction task. Two responses, A and sR ic AE 
Was c দ and Es respectively, were available to the subjects. An 4 
pre. Correct with probability sr, and 2 on Ti; and Ts trials, respectively. F 
Sent study assessed the effects of the covariation of Ts and the similarity lo) 
ern Ton PAT), the probability of an A, response givena T; trial. He 
Fesdhe oS required to identify the trial type (loud or soft) and were aR 
the ee ‘ of identification responding. Partial feedback was suet to elimi টে 
identigce of cue similarity on discriminative event prediction and also impre 
cation of trial types. ‘The reinforcement effects indicated that subjects 


ea ; 
m ‘to behave appropriately to identified cues rather than learn to make specific 
Otor responses, 


1. INTRODUCTION . 

He experimental situation of discriminative binary prediction consists ofa 

yt 7 Es each of which commences with the onset of ue by ০) 

Wo Ia 10r T. The probability of Ti; is B, and the Probeoi 0 te 

to th Sponses, 4, and A4;, predicting events E; and Es respective'y, are য 

Subject. Ona Ti; trial, an A; response is correct with probability aj, An 

coe POSE is correct with probability 1 — m1. On a T's trial, en 

ly with Probability 2, and an As response 1s correct with Pp 


Series 
Sti 


A previ i 968) has shown that P(AilT'), 
th Vious experiment by Massaro et al. (1968) 
jn Probability of an A sveponse givena Ti, trial, is dependent upon BE 
0.5 “HC Similarity between T, and Ts. With values of nn Te rE 
OF 02, the results indicated that PALIT) was: (1) a linear UNC 


Wh, 

inten bl Cues were highly similar, (2) independent of Ts and ye a 
ou s diate level of cue similarity, and a U-shaped function bi he 
Simil Were highly discriminable. In that study, in order to de on ho 
two Ny between T', and T's, the subjects were required to Ee wl FLY 
fy Les (loud or soft tones) was being presented before no ee ঃ 
Biven | event that would occur on that trial. However, the su 1 I 

feedback regarding the accuracy of their identification of trial type. 
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present experiment is an assessment of the effects of variations of rs» and cue 
similarity on P(A; |T',) with partial feedback (25 or 75 per cent) about identification 
responses. 


2. METHOD 


Subjects The subjects were 144 University of Massachusetts undergraduates and they 
were assigned randomly to the experimental treatments 


Apparatus Up to four subjects were run at a time, each seated at a tabletop enclosure 
containing an Estes Straughan conditioning board, which consisted of a white centre warning 
light and two red event lights, each positioned above a spring-loaded lever switch. The 
centre of the board contained a 2x45 in. panel recessed 1°5 in. with two spring-loade 
buttons mounted one above the other and labelled loud and soft: Tones were generate 
by a Hewlett-Packard Model 200 audio-oscillator and were | resented over matche 
headphones with a continuous white masking noise. Experimental events were controlled 
by Lehigh Valley 1620 Probability Randomizers, Hunter Interval Timers, and relays. 
Events and responses were recorded on an Esterline~Angus EVERY recorder, 


roced Hl 
EE TEE Ae Ho a trial. The tone lasted 2°5 sec., during whi 
respective button. At the nd € a loud or soft identification response by pressing 
on feedback trials On both of the tone, the correct button was illuminated fo; 0 
illuminated 0-9 ae0. afte. Ee feedback and no feedback trials, the warning light 
Hade-thelr nkedisfton t tone offset. The light was left on for 1°5 sec. and the subjec, 
following the end of REE The event light was illuminated for 0-9 sec. immediate y 
interval was 4:2 sec En Sh abe light. . Hence each trial lasted 5°8 sec. The inter-trid 
events were presented ind rn] ject received 300 trials in which all stimuli, feedback চে 
Subjects were give ৰ Y by an appropriate setting on the probability randomizer, 
differing slightly irr lo the following instructions: ‘You will be receiving two ton 
of the tones you are i ness over the headphones. Your first task will be to indicate ba 
You will press the to না to. You will do this by pressing one of the two bE 
Notice the buttons ন oe aba for the louder tone and the bottom button for the softer a 
sure of your decision ll সু elle loud and soft. You are expected to guess if you are 6 
will go off and one of th et will have 2% sec. to make your choice. After 2} sec.» the or 
did come on. To imediatel Paton ba come on, indicating whether the loud or soft a . 
Your second task is to pr. yf after this, the white light on top of your panel will 80 vill 
come on. As soon a $ EE of the two red lights at the bottom of your pane He 
two switches. After RS Ee the white light comes on, you are to flip one Gf the 
red light above the switch ave flipped a switch, one of the two red lights will show. 3 
on, you were incorrect an that, EN comes on, you were correct. If the other rr oro 
you have been correct on the ee you will always be told whether OF ion 
regarding your indication of a loud or gofe i will not always be given ™ ° 
; tingent 


Design Three 1 los 
oan onn obi ad te bs crossed with three stimulus, OE were 
edback conditions. nditi BL 
3 and Tir Seg heen Dain’ of ne cu SOO i tones er Tea 
K » Biving a 1°5, 3 and 6 db intensity differential (AJ) respey fl e: 
for three decreasing levels of cue similarity. The three ifferenti edules % wer, 
(a) Tri = P(E;| Ti) =0°8 for all groups, and 0) a= P(E F005 apd চট) respectivein 
call ro io 2 rca and group’ 0:8-0:2, Thus, there were eight subj ec pe 
each cell of a. etween-groups factorial desi subjects. ack, 
analysis of variance of P(di)’s included the three nae t: Arr and feed ck 
and the three within variables of cue (T; vs. Ts), feedback trial (yes or no) and trial 


(three blocks of 100 trials). 


ch the 
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~P(4, IT. 

bet hs hati 180-21; d.f.=1, 126; P<0-001), increased as the difference 

On Performa Ts increased (F=8143; d.f.=2, 126; P < 0-001). Discrimina- 
nce also increased over training (F=1956; d.f.=2, 252) and 

and Ts differences (F=7-00; df. 

d their 15 higher order inter- 

hat cue similarity did 


ti 


Increas 

ed 

=4, 252; er over training for larger 

ৰ 5 & TT" 

Actions were ১01). | I he cue x AI interaction an 
tot disrupt 4 Statistically significant. This finding t 
Partial EMEA ee performance in any significant Way indicates that 
was sufficient to eliminate the effects of cue similarity found in 


Previo 
US studies (Massaro et al., 1968). 


0.9 


>) 
~ 


টী 
Marginal statisti L 3. RESULTS 
istics Fig. 1 shows that discrimination performance, P(AilT;) 


[= ) 
হু 
\ 


° 
Cl 


un 
Fd 
Fr 
[- 
J [=] 
es £ 
Fi 
ভু 
a) ত 
50.5 
c 
S 
5 0-4 
[= 
2) 
5 A 
0:3 হা 
5 L 
>] ~ 
= 
2 
২ 0.1 


0-2 


UR 1 2 
নয 1 Mi Blocks of trials ্‌ি E et j 
ঠ i jals as a function 0 Trial 
blocks ean proportion of Ai responses on T; and T's tria bs of 
nda ——py PTD =~ == \ 7 5.80.8 group; 1, 08-05; 

A, 0g oo, , P(AlT); PAT); [9) 


Fi 

ds i also shows that, overall, P(AAilTy) We 
when » 265 P <0-001). The fact Fat PIT) was about 5 per cent lon 
er than when 2=0°8 or 0-2 seem! indicate that a dis ive 
pres S with little or no cue value (i.e. oF reinforced on # he asis) 
bn Rl P(4,|T;) relative to a stimulus with . This finding agrees 
udies where the trial types were highly discriminable (Popper & Atkinson, 


ex 5 
» except that the overshooting reported in 


jon of Ts (F= 172°7; 
r cent lower 


not found in the 
epression 0 Id be due to the 


he present stu y. For example, 


Prese. 
was also dependent 


rand 
€ 


nt 
experiment. This overall d 


om l 
ness of the occurrence of feedback in t 
d that event 


a trial. 


subi f 
Jects may have hypothesize OCCU sthesins 


Ww 
hether feedback was given on 
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TABLE 1. OBSERVED AND PREDICTED VALUES OF P(A, niilTiniaT sn 
FOR GROUP 0-8-0-2 


f লে ks c condition 
(‘ No feedback’ trials; trial type correctly identified for the 25 per cent feedback c 
Pooled over levels of AI and trial blocks.) 


A; nEun) 


Frequency Proportion 
ED dicted 

Tint Tn Akn Ey Observed Predicted Observed Bee ন 

J J 1 365 380-9 0-867 0 

1 1 “ 84 89-0 0-737 a: 

1 J | 2 1 59 52-1 0-634 iE 

; R 4 17 154 0436 

1 2 y 10 | 0 5400 4 

1 2 If 3 63 61-0 0-578 0746 

1 ; 5 : 93 91-7 0-756 0 

b দ i 2 430 433-4 0-898 ne 

2 1 1 1 42 45-1 0-089 ss 4 

2 j 1 2 20 34:8 0-146 0440 

2 1 7 1 50 57-2 0:385 0.605 

2 1 2 ঠি 15 20-0 0-455 টি 

2 ঠ 1 1 14 13-3 0-636 f 440 

: 2 2 ন 44 47-1 0411 254 

2 2 2 1 30 28-5 0-268 0.095 

| a f 50 45:6 0-104 
€=0:314; €2= 0-334; X:=11-20. 


Brn) 
TABLE 2. ‘OBSERVED AND PREDICTED VALUES OF P(A, unalTin Tanda 

FOR GROUP 0.80. | 
(Feedback trials; trial type corr কচ 


! dition 
ectly identified for the 75 per cent feedback con 


Pooled over levels of AI and trial blocks.) 


Frequency Proportion ] 
7 cote 
oe EE Lt Ga Ed Era Beet 
1 1 { 1 623 599-7 0-941 0-711 
! 1 2 ন 107 10460738 GT 
j 1 5 74 63-9 0:692 039% 
1 2 bk * 12 8:7 0-546 ay 
2 1 2 19 10-6 0-704 0507 
i 4 ‘040 | 
: EFM ff Uh 
2 1 f 2 415 412.2 0:912 ooo 
£ 1 1 5 57 54-8 0-098 0°289 
2 1 2 1 28 390 0207 9403 
2 1 2 2 25 38-3 0-263 0-606 
3 2 1 17 20-0 0:515 0-606 
2 2 1 5 21 21.8 0-583 0-403 
2 - 2 1 41 47-5 0-348 0-289 
oh 2 2 2 £ 35.2 0-377 0-09 } 

; - -089 
€=0:383; 0:=0-413 50-6 0-0. 


3 X*=76.85, 


PAPE 
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The se ঠৰ 

model কপ যা * in Table 1 agree with the predictions of the 
4 € subject making an appro yiate Tes i 

ial types n and n--1 are identified as the St Ee eS NEANLEES 
Mees P(A, |A,E;) > P(A, AEs) > P(AilA2E1) > P(AilAsE), 
লা 5 en the trial types are identified as different, the rank-ordering is 
dos LE ‘Table 2 shows that reinforcement of identification responses 
ag for the ন this mode of response, since the rank-orderings are the same 
TE s without feedback. These results support the thesis that, in 
ldentifed e On learning, subjects are reinforced for behaving appropriately to 

ues rather than for making specific motor responses. 


4. CONCLUDING REMARKS 

forcement schedule in operation for one 
ther trial type. A stimulus with some 
ding to the other stimulus compared 
forced on a 0-5-0-5 basis). When 
tely (i.e. predicting the most 
likely to behave appropriately 


trial i J -"জ learning, the rein 

cue validit es Le responding to the 0 

0a Sit enhances appropriate respon 

SLR ঢু with no cue value (i.e. one rein c 

requent Eh reinforced for behaving appropria 

on the lone the trial type) he will be more 
g trial. 
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ঃ cL nted below are the expressions for the condit 
Du|Tin+i TinAv,nEtn): 
A+pUfcat(l-¢ )p} 
PAT TAE,) = £ ( 


= U(1=72)0 
PAITTALE) = Af{cpt+(l ন) (CE 


ional statistics of the 


ict J Y tification 
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STRUCTTR:, 
CERT PROPERTIES AND MEASUREMENT THEORY OF 
SETS ADMITTING A CONCATENATION OPERATION 


By WALTER KRISTOF 


Educational Testing Service, Princeton, N.]J. 


a asHcn EE erin a contribution to the theory of unidimensional scaling based on 
Ordered sets Ve operation defined for pairs of objects from certain continuous and 
aid commute a Inferences about structural properties of MW, such as IEEE 
entirely BEA] of the concatenation operation, are obtained by লা 0! i 
throughout aL non-metric treatment. It is shown that reflexivity ho. ৰ er 
elements F Ne Just for a single element and that commutativity of two ‘ 
dent. মৰ ds either generally or in no instance. These statements are ৰ epen- 
Which fo hie study employs concatenation operations defined in an El gr সে 
that of c mally and necessarily satisfy the reflexivity requirement and then a 
Scales ee ETE By this means, the existence and essential uniqueness 

establish Hl with linear expressions for the various concatenation oe nl 
these ex ed. Rules are derived for the way in which the parameters entering in 

pressions can change with admissible scale transformations. 


1. INTRODUCTION 


le - i . . 
forms ক Paper is intended to be a contribution to the theory underlying certain 
abou of unidimensional scaling. It is primarily concerned with inferences 

ntered in psychophysics, 


6 y 
he properties of sets M of objects which are encov 


8.6. s ্ : 
ener], ho of loudness, brightness, and the like. b 
e DUNT measurement theory which has received growl ge 
Urrent ication of the Luce & Tukey (1964) paper on conjoin! oe 
Ness the SUrement theory concentrates on establishing existence ET 
Corems for numerical representations of collections of objects yi 


Vario 
ঞ asic assumptions and so provides the theoretical bases for an ne 
S0 far * of practical scaling methods. A review of achieve Trent op 
Another Psychology is concerned, is given in Tversky’s (19 ) 

ৰ account has been given by Kristof (1967 a). ude nl FREE 
(los ap Portions of the present paper were stimulated by 
tepresent + He used the bisymmetry as 
eich is ation and uniqueness theorems for the case © 
“one udi Not necessarily reflexive and commutative, 
mai LS Section of this paper. However, the present |) ra En 
bs inquiry i h basic P 

my, LOS namely, an inquiry into suc হে 
Use tivity and nats na bershD of M (Def. 5) made entire yY 
sults Ss scale-of M. Apart from na t HELE domain 
i ul as C 
n be scaled area, R11 and R24, may be usefi 


Thus it belongs to the 
ng interest since 
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The paper has the following plan: some ba 
are given, then a study is made of the reflexivi 
and this is followed by an inquiry into commuta 
defined formal reflexive Operation of concatena 
commutativity enable another o 
commutative. On this basis, 
elements of M, i.e. constructi 
commutativity requirements, ived for 
intervals. The existence and uniqueness problems of linear scales are so ation 
different sets of basic assumptions. Attention is given to the transforn 
Properties of the numerical Parameters involved. 

The results are Obtain 


} : K s little 
ed in a strict Way, but one which demand 
mathematical knowledge. AIl necessa 


: ion 
ty of a concatenation opera 
tivity with the aid of a Le on 
tion. Some further resu 


ন h their 
nite sequences, even thoug But 


em Il at definite Structural Statements about M which can be ver 0 
Pirically From a mathematical Viewpoint 
to a certain set of is 


more Convenient I 
Let i Anguage hic 
are of interest hers " nn EL oe elements a,b,c... x, Yy,2. The sets W 
May be defined K 
Def. 1 by the following requirements 


je 
>» < and =. Here = exp 


tively’ 
M, whereas > and <, respec 


4 ৰ min 
(iii) Mis a contin De clement, “max and a minimal veo | Fr into 
a lower class 4 an ডল ‘his means that every partitioning of 7 pind 
bes Pper class Z ede 
cut, i.e. cither 4 ha সক Mau » ANZ=0, definesaD A bas 
no maximal element ang z EMS 00S es has ah an. order. 
Continuous set is Necessarily dense, i.e Hel a 1 follows say, implt 
the existence of 2eM with x>2> yy. and yeM with x>y, 
M may be regarded as Composed 
elements of Ma 


Of ordere 
S representatives of ৰঃ 


t 
nd 
quivalence classes 4 rde 
relation may be given 


0 
ilistic in, ut equivalence classes. The 
a probabilistic interpretatio A 


ith ions. 
Sic definitions and assumptions 


b) 
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bs 2 2. The sets considered here admit a concatenation operation which 
dL e performed on any two elements of M -. This means that xeM and y eM 
RM uniquely (in the sense of an equivalence relation) a third element 
which may be written as 2=X#y Or 2=%X0J), depending on the context. 
livre Pe of such concatenation operations may be halving Percept 
yeSerrB which are defined by two given stimuli, or the construction of astimu নন 
extre ing most two given stimuli. Time-order errors or tendencies to avoi 
Me responses need not be ruled out. 
RI ক basic assumptions, Cl, C2, etc., are NOW formulated. CT 
the টা etc., will be drawn later, which are aimed at improving our insight into 
uctural properties of M. | l ন ? 
My e Ne Each pair x, y of elements of M determines uniquely a third ecemen 
(existence). 
his assumption is merely a repetition of Def. 2 an 
reasons. Further, for all elements: 
Wy =y#x (commutativity, symmetry). 
4 N#x = x (reflexivity, idempotence). 
+ (vey)#(uso) = (vxu)s( yo) (bisymmetry). ar 
২ V2 iff N#) SN 4), respectively, and YZzy if y= 


d has been included for 
formal 


x#)' (mono- 


s ith 
ry w#a>b together with y#a<b implies the existence bs hes. ho 
ue) vig nilaly, a#u>b, together with ax¥v <, S 
is cantly cee in C6 are 
ile 1s easily seen with the help of C5 that the elements 2 and w in 
€ in the sense of ট Tati j 
an equivalence relation. cciie 
rst 2 holds, then the econ parts of C5 and C6 follow তন | টা % 
ADDL Parts of these assumptions. This is not the case if C2 is 
: ben ens he validity of Cl to 
, to as been shown previously (Kristof, 1967 he সে phe ক 
Scale ether with the present definition of M, imp ies Re ) yell 


implies tht 


২ jp nerical representation), s(2), of M such that, ক বা hss 2 
operat, 1%) =5()), respectively, and s(x#y) =3 {s() +s 2 i x and Y '. 

Ti may be aptly termed ‘ halving the interval DE the concatena- 
tion however, Deither C2:norC3-areé expressly required, t পণ ফল । as 
be ton will be denoted by the symbol 0 লী” eng OF whether * Tt 
0 is; y al ’ p zagl S 

is that Def. 1, C1, C4, C5 and C6 sa to those in bt রা 


In yu ! 
(959 4 Se. The relations of the foregoing # UC pere is a certai 
over ) Work were outlined in the introduction, an 

P with his work in the results of Section 8. 


TY k 
I 3 3. A DiscussioN OF REFLEXIVY joate whether definite 
State, 1S the ion tO investiga can be made 
mer Purpose of the present Sector i, operation © 
aten 


৯0) ou 
Oncerning reflexivity of a cone 
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is wholly 
i rhether C3 is w 
basis of Def. 1, C1 C4, C5 and C6, i.e. whether 
the basis + 1, Cl, 
Ph of these assumptions. 


C3 on the remaining assumptions (e 
lation of such dependence ( 
data must satisfy in order 


only one of two distinct alternatives. 
Rl. (1) Let a and b be 
x satisfying aox <b. Then 


an 
in 


1 very 
proofs of which are #8 { 


‘<x and X 
ail. Suppose x RR to 
C5 then hence x’e4. Now LG TG Fant from Nth 
assertion that 4 Possesse imal element, ", that is di “lement : nt # 
<b<ao xmax. C6 ensures the existence of tb t an elemer™ 
aoz=b and % <2 < xmax. The density of M Buarantees t © ontisdiots 
exists with গাধা <ৰ and, by 5, 202’ <aoz=b. This gs ven" 
maximality Assumption of %", hence the assertion is true. a ফর, Foes 
LED Let 4 be the set of all elements x satisfying x vapid Ei gi 
Then +e and ¥' <x implies x’ eA urther, if 4 is not Ee follow! tn. 
max, i imal element. Similarly, t >b, fi no 
qh edt (2) Let Bb | elements y satisfying ia if it does 
henyeB and y > y YESSY EB, Farther, i Be not empty an 
contain yyy, t AS no minimal 


Mot contain y 


Se 


d 
n 
n down into several steps. 1b # 
<% a! x4. B C5 

“eA by definition. y 


চট 
xima 
(i) Assume tb On, that 4 possesses a ma 
¥’, with 8 <tmax. ‘Thus ¥" 0x" <b 
(iii) Consider the 


1 Set S of aj elements s sat! 
not empty since x” eS 


by ii). urth 
suppose that Xmax eS, i 


omen 
i 
g fixed. ly 
isfying x"0s<b, x" fi ame ty 
EE, SG 
6, x" 2 


in Xmax* ing 
0es not contain A x" bet ith 
nw O Nmayx <b. Ut xmax O Xmax 2b due 
BREN A “max <b SXmax O may. 
on 
UO Xmax =b and, by C5, x" < 


uh 
ent 2 
By C6, there is an re uo fe iw 
> SAmax. By‘GS, ou UO xmax =b, ossible ss the 
If the sign = holds, then 4 1s chosen so that 3° <y’ <u which is p ic 
the density of M. ence, by 65, 7 Ou <p 


ৰ ntra 
In any event, this co 
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TE that x” is a maximum, hence S cannot contain xmax. Therefore, 
CINg to RI, S has no maximal element. 
Ne Consequently, there is an element sin S with মৎ and #o0s' <b. 
similar ia er the set T of all elements t satisfying tos'<b,s fixed. tb Reasoning 
becaus 2 (iii) shows that T has no maximal element and neither is it empty 
ER ET 
F be (Then, however, there is an element t' with “<r and t 0’ <b. চহ 
Case, fro Vr one of the elements $’ and t' or take = ifs t [] An 
a <৯ না Ne follows that #0 # <b hence #€A. Buta" <s and <t Cr ies 
fore 4 he Again contradicts the assumption of x" being a maximum. ere- 
AS no maximal element. Q.E.D. 

If, for b given, there are elements x and y with xox <b<yoy, then 
Unique element 2 with 202=b and x <2 <y. EL: 
roof. ‘The sets A and B of all elements x’ and y’ satisfying * 0% ও, 
contain >. oY', respectively are, by assumption, not empty. Br 02: el a 
an max and xmin, respectively. ‘Then by R2, x" eA and x" <% imp y 

‘ina Other hand, y eB and y">y' imply y"eB. Further, by R2, A has no 


Maxi fs 
ler clement and B has no minimal element. Now suppose oR পণ 
Nn ren With 202=b. ‘Then this yields a partitioning Me St (i) 
of D Which is not a Dedekind cut and hence is a contradiction ছ E 3) 
C5 Si ডা Onsequently, an element 2 with 20 2=b ক | EE GE 
fess, Ge <y. C5 also ensures 2'=z if # 0#'=20% which p 
he follo i g i 
Wing definition is now introduced. A erillbe 
Ih h 2 and it W! 
Tegarqeg ন (2), 2 eM, will be used to denote the element #0 


a function which maps M into (or on to) itself. 


there is a 


Re Beneral fixed point theorem is now dered. i.e. o(z) possesses at 
least One « here is at least one element x with o(®)=%, Le. 
OP ed point’. f d point. It follows that, 
nn Bane of Ssume to the contrary that there is no fixed p A are elements 
uy and ola, o(xmin) > xmin and o(xmax) < Xmax- Ee R4it will suffice 
to 0 With 0 es 
ig U< wo) < Vor uy Uh, = 
Bc etente nogy ee SRN te B h ee of elements uy, Up Ups «7 yr 
fined Pon uy. An infinite seque cll) for n=l; 22 


Sequey As follows. Set u = o(up) and un b assumption. 
c 1 y 


i . } . ely, to <1: 
elements Un is strictly increasing. A Te same line of reasoning 


it Een, =U 0 Uy <u; OU =U, hence us < : bounded (certainly by ons 
«Will orally wy <u 1. Since this sequence 18 7°. ৰং ‘smallest’ elemen 
ং er? thas 2 supremum u=sup un, Where 1 i$ b of limits, including 
Rbrem than fj un (detailed proofs concerning the ET dn al] 1 
yence চ “nd infima, are contained in Kristof (1967 a))- it is certainly the case 
hat wD »uUu0o u> = for all n. Therefore, 1 রি? শি impossible. 
tame © “> 2, reme borin the ‘definition of 4. But Be and R3 guaran- 
t mbering the I un OU ট 
রি the Ssumption forthe contrary gives uo ৰ 0 >un for all n el! 
3 *stence of an element w with u=% 0% an 5.P. 
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; =u. Substituting 
dicts uw being the supremum. Henceuou =14, i.e. olu) =u tL 
contradic! E 
4 R4. Q.ED. . . ly oe 
NY a be noted that the proof of R4 did not Re 
চি ক লক কক = of C4 (and C2). C4 will be invoke 
Vall 


x. 
and yo 
RS. If x and y are fixed points of (2), EM, so are xoy 

Estey, if x <, then ¥<x0Yy<y and x<yo X<Yy. 


y 0) 
Similarly, (J lk 
Proof. By C4, ra a edna adfmea ad x <yo# 
o(y0%)=y0x. Further, N=YOX<uOY <y 0y=y by 
analogously. Q.E.D. 
This means that the set 
more than one fixed point. 


R6. (1) Suppose thal 


if there 18 
of fixed points of (2), 2 eM, জগ 1 j 
However, density in M cannot be in gt 1,2, ); 
t uy<olu)=u,. Let eA is ‘larger 
Then u=sup uy is the ‘smallest’ fixed point of o(z), 2 eM, be Un Yr 
than uo. (2) Similarly, Suppose that v,>o(v)=v;. int of oz), 2€ 
(n=l, 2, ...). Then v=inf ©n is the ‘largest’ fixed poin 
Which is ‘smaller’ than 00. 
Proof. Only (1 


) is proved here, the proof of (2) being similar. 
from the proof of R4 that the infinite se 


i ict 
equence of elements un 2) Sot 0 El 
U=sSup w,, which is a at From 
fixed point, t, with U<d asoning $i 
1<t<u. ‘The same line of re 


tion 
This contradic 
Ver, u cannot be SUp un. 


It follows 
ly increas" 


UuOu, hence u 
Wn <U<u for all n. Then, howe 


Proves the validity of R6. Q.E.D. 


et x and y be fixed Points of ol 
element with x < 


1 
i { রং In other we 
to<y. Then y, is also a fixed point of (2). 1 s themse 
ll elements lying between ’ two ed points of 9(2) are fixed point Therefor 
Proof. () If Wy=u1, 0 uv, then there rem: 
let uy SUyOU=u, (the other 


ains nothing to Eo 
Possibility Uo>uy ou, would be han ge oe 
Casing sequence Of elements up, Uy, + 


f R4. ef 
ing defined Precisely as in the proof 0 is 
mallest ’ fixed Point of o(2) ‘la 


is no 

is ¢ isn 

Tger ’ than uy, i.e. ther @ 

en uy and ux. is a unid 

i el EY 

]। tu’ ie "i 00uU. Then, by R3, bE. <ui0 py 

Again by RS, Wet, euious and tr Se This sont ed 
Sain Dy R3, there i 2 Unique element U's with UL =u 0Us and cL 

% Ng yields an in 

elements, ... <u <u 


nilarly) 
d similar, 


» 4 is the ‘ s 
fixed point ‘ b 


en Xe 

j (ৰ ing sedqu ’ 

ite finite strictly OE ded bm, 

2<Uu Sup with uw i u'n,yp bounc OR 80 

Therefore this Sequence Possesses an infimum, uw ন un, with uw <u th? 
(iii) It will be PrOVEd Hie pe 

u<u ou’. Then the 


1 
OSC yy 
tou. To this end, SUPP” 9; 
re must be an inde 
from which we infer uw’ 


i SUM yun 
ন X m such that 1 myj1O0OUmyp1 

mii<uw by C 
Now examine the po 


ing i 2 
fe ; ’ being yy 
ibility 4’s.,  -0Wever, this contradicts u { 
ssibility u>uwouw 

, 

UW <Un=Uni0ou 141 for all 


tO god 
It would then follow kr 2 o% # 
ln, By R3, there is an element v with uw 
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U<v<u 
আলি বত আতে all n This contradicts w' being infu. Thus the onl 
(iv) ur Fe th 2 =u ou which must therefore be true, i.e. olu')=u' Jl 
Namely, ip oy চী largest fixed point of o(z) which is ‘smaller’ than u 
must be an ibd e there is another fixed point, w’, with w' <u" <u. Then th e 
uy, hence " such that u'r <u’. By C5 we obtain u'k_1 En OU uo 
পক <u Which eo He ie u Lu After k-—1 additional analogous steps we arrive 
mulated in (ivi s the initial assumption wu" <uy. Hence the assertion 
v) Therefo 1S Correct. 
Contain any fie Ul u=SUp Un and uw’ =inf w'n limit an ‘inte. 
Points. This le Ee of o(z) in its interior. However, 
Consequently cads to a contradiction to the previous statemen 
RS. If bg HE i.e. R7 is valid. Q.E.D. 
largest # and Lt 2 eM possesses more than only one fixed point then there is a 
a ‘smallest ’ one, i.e. the set of fixed points has a maximal and a 


rval ’ which does not 
u and wu’ are fixed 
t by virtue of RS. 


ie element 
Proof. (i = | 
Will be gl a detailed proof of the existence of a maximal fixed point 
» the proof involving a minimal fixed point being similar. If xmax 
It is assumed, therefore, that 


Isa 
Xed poi 
V 1 কি 
jax is A টু ০-0 the assertion is trivially valid. 
al It is Dent point and that the set of fixed points has no maximal element, 
opnente * seat i the right. Then the continuity of M requires the set of all 
der thata D os than all fixed points to possess a minimal element, to, in 
হ্‌ (ii) wz edekind cut may result. 
apidered. Ae uo by definition. First, the possibility that uy <u is 
re Un, yy nl infinite sequence of elements in is defined by setting 1) =o © Uo 
UU i) ts Uy, i.e. up.) = olun) generally. It has been already shown in the 
in PUGgsssee 2 a sequence of elements un as defined here is strictly increasing 
compatip ন supremum, u=sup un, to <, for which o(u)="1. This 1s 
of (ili) Su ith the definition of wo in view of R7 and so disproves uo < 1p © to: 
vali (2): he Pose, on the other hand, that tuo > to © to- Let t be any fixed point 
aly of he t<uy. From C5, tot<uootis, hence t <up 0 uo < io: The 
ay, SPE ton 3 Statement for all fixed points t contradicts to being the ‘ smallest 
g (3 ৰ ৈ mE 
“ren ee by all fixed points as Imp he continuity 
Th 1). 
ole all Te a contradiction is reached in either case. Consequently, the 
Ment, oe ‘greater’ than all fixed points cannot possess 2 
0 স i 
tder that i however, the set of all fixed points must have a maximal elemen' 
ন € Continui j isfied. Q-E.D. 
of _“ombini inuity assumption may be satis 
I in ©) 2 R4, R7 and R8 Jha be stated that the set of all fixed as 
126 H . 
Prec * ma 1, is not empty and forms a ‘ closed interval hs not been 
“ded, Y Possibly reduce to a single element since this case has 


de it is necessary to prove RS. 
nd strictly monot 
m, whichever ap 


licitly required by t 


efor 
€ furt 
itp + et progress can be ma 
* =I; gs Xj, «5 ny --. DE AD infinite a 


lim 
HH. bel 
n being the supremum or infimu 


plies, Xn 
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. Ee) = 
For any fixed element c, ceM, (1) lim(s, 0c) =(lim xn) 0 c and (2) lim(co ") 
lim xn). . 5 xn 2 XnL 
col Only (1) is proved, the Proof of (2) being similar. | By C5 ie g 
implies xnoczxn,10c, respectively. Therefore the infinite ৰ elements 
elements xn 0 cis in the same sense Strictly monotone as the sequéncee applies 
Xn and consequently it possesses a Supremum or an infimum, whicheve ele 
It suffices to restrict Ourselves to the case of a strictly increasing ; 5 5 > 
because the other alternative can be treated in complete analogy. T 5 ppose 
for all n. By G5, ¥0C>xn0c for all 1 and so x0 c2 sup(xn 0 c)- By C6, 
Hence xoc >Sup(xnoc)>xnoc for all Re endent 
ith SUP(%n0c)=4 0c and x>»'> Xn, x IN Bereta 
of n by C5. But this result contradicts x being sup xn. 
X 0 €=sSup(%n 0 c), which establishes RO. 0.5.D. lements 
RIO. If there are tw fixed points of o(z2), 2 eM, then all e 
S. 


ed 

h ৰ » of fixe 

! rding to R7 and R8, there is a “closed interval ements 
POE O(E) in MM. Tet @ be its minimal element and b its maximal € 


i a element 
at x0 is an element with X9<a. Now choose ar 
must be a fix. 


ite Sequence of 
=%n0xn. Herex= 
be a fixed Point i 


: by #1 
J. Xn, -.. iS defined Sl 

because 
ontra 


2 
=Xn0 
elements yy, is defined by yn thet 
1 and so, by C5, » 


sn 
06) 0 (on 

0c=(x 0%) 0 (co c)=(xn 0. 20 

<n 0 yp and Jn=mOc<coc=c which implies Jr by 


i i yj 
C5 then requires a Jement wo 


: ঠন €' 
Of c disproves the existence of an € 


alogy, there canno 


el the one gi 


t be an element Xo With Xo 
i 


fore 
.E.D. Ven previously and may there A 
R10 has the follow; : 5C5 
conceivable at this stage: (1) Bich Og, consequence. Precisely two C8°° opt 


Jee ot 
of M are fixed points of 2), ze. 1, Oly one fixed point or (2) all €¢ men 
of Mis reflexive or 4 + In oth 


elf 
ন ‘ust one € give 
(2) all element er Words: (1) Either jus e refle ন 
element exists has been established ন 2eflexive, That at i a 24 
regardless of C4. Y R4. This Jatter theore 
It is now shown t 


hat the tw 
(1.1) Let M be 


yt th {Y 
0 Cases ust 5 a ed occu ef 
the closed ga Just mentioned do inde. d 


Ement of the real line [0, 1] an 
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¥0y=iyx 
4¥+2y.  Obviou i 
2 ] . é 
e only reflexive elem sly, this operation is an allowable definition of 
als (1.2) Let M be [Ly x=0 and it is an extreme element. Le: 
ment is x= 0, but this +i ] and define 0 as before. Again the onl i 
1 (2) Let M be 0 A time it lies in the interior of M. FEL 
, 1] and define xoy=Ix+y). Here all elements of M are 
Can easily b 
and C1 Y be shown that, i 
Cs nO ae es bo of the above cases, Mis properly defined, 
্ি Vious d . 
f the main fixed emonstration and R10 fully justify the following formulatio 
th RI. Tf I RoE reflexivity theorem of this paper: i 
roughout, en EE the initial definition and C1, C4, C5 and C6 apply 
S exactly one st obs ce one of the following two cases occurs: (1) either M 
It should be fre na element or (2) all elements of M are reflexive. 
Y way upon C2, i embered that none of the results of this section depends in 
i 5a CHAE the commutativity of the operation 0. 
i S in Scotlon 2 ce of RI it will be recognized that the general formulation 
ee reflexive Is quite redundant. It suffices to say that there are two 
reflexive and ements in M, the only alternative being the coexistence of 
one non-reflexive element. 


4.1 
+ {NTRODUCT 
UCTION OF A NECESSARILY REFLEXIVE OPERATION 
of the operation 0, 


. ‘The 
1, nex স্ব 
es C2 রা ee চলক is a study of the commutativity 
that reflexive Wwithou J € argument will depend on the concatenation operation 
can b fs Zequiterment i restrictions of any kind. Yet is has been shown in RIl 
€ remedied i is not necessarily satisfied. This unsatisfactory situation 
e starti in the following way. 
UE isa set M and the assumptions C1, C4, C5 and C6 which 
b operation 0. On this basis a new operation is introduced, 


Dy.4 
* 2=%e0yis written iff 202=%0)- 
satisfies the assumptions C1 and 
ptions referring 


Iti 
0 ISn 
rg TL oor + that this new operation 
() © Operation rder to facilitate the distinction between assumptiont 
1 be tem: 2 and those involving the operation 6,2 distinguishing accent 
Yo C1, pe ly attached to the latter. * E 
+0 oy < any xeM and yeM with x <Y, say, then by ap { Ee 
Rene. Gn oy. According to R3 there is a unique element 2 wi 
erally ex; alogous result obtains if ¥>Y- Hence the unique clement 2=xe) 
03” exists. Q.E.D ” 
tae wx is trivially valid fbr any BEM OED. 
y et the elements x,y, © be given. Elements 2% 9? St ALE 
$s. Fee wow=UO0, 9° =20%, oft EE 
yA ensures the meaningfulness of these definitions. The following 
€ Written: 2=xe0), w=U0d q=2 0% r=%0 s=ye0 t=70S, 


ee 
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y ) (u [6) 0) 

=(¥eu)e(yev). By C4, (vxoy)o ain by 

tee FE ৰ es no ডুব = En 5) ন (sos) by EE ty defini- 

(x0u)o(y ov), t =(70s) 0(r0s), thus 20%=70s by C5. th En 

badly a 2: J=t by C5. Therefore, by OT 
সকল বং) ou)e(yev). This rs cor “Hagel? pos” ye. Then 
xe € EBs k ke 

4 ন Ul ন] EC iff Re, ie. HE x ত ন 
ERT G5 jniplies L232’ if x c)2y' oy, ES 

hse Xeyzxey', respectively. This establishes C5’. 0Q.E.D. 

0! Suppose that xea>b 


. lies the 

it is found that aex>b and aey<b ar. Q.E.D: 
ment w with aew=b. This যক dl 
(Note that C3’ is always valid regardless of whether C3 He! 2x0 2m 
As to C2; ivity: For xeM, yeM given, let 8 22%, ie i 
YO iff 20 2Zw%0%w and, by C5, a recisely Hoe 
¥ey2yesx, respectively. This means that o is commutative in p utative Case 
i i ich 0 is commutative. Further, in all CEE 

Served in going from oe to 0 and SrerTUOL , 
Sequence, studying the commutativity of the hs fact permi 
Y of the operation o. oted that t 1 
hroughout. It should also be te derivatio 
’ ‘-* Temain valid if e is substituted for © because in t 

ms commutativity was ever presupposed. 


is 
As a con 


pA 
3 erive 
It is the present aire TO ajiaiey © 
Senerality of @ and so of Gs TT 
6 ensures the validi 


torent 
i differ d 
Her B 0 
Sons it is assumed that there is a commutatiwe p je require 
elements u and le. uev=yeu. Also, without loss of generality, i 
that u<v. These assum 


Ptions will be used 
R12. Consider the formulatio 


050 
supP 
V=1(d) by assumption. Now 
=u1uev. According i 
- The uniqueness fo nl 
Y=’ by C5’, Q.E.D. » « petwee 
and assumes every ‘ value 0 
limits. and Ja 
Proof. By C5, = implies SUS ¢’ eu, lence y oyz=uey ple v2 
respectively. Further, it has been shown above that u and v are pos 


te 


pA 
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0, Now choose an element y withu<y <. Then, by C5’, ueu<ueoey< 
0-00; According to C6’, there is an element x with uey=x0u1 and 
t<y<z. QD. 
i a Let y’=i(v') and y” =7(x"). Then ‘(x e ")=7(%x') e +(x”) and T(x) 
Automorphism with respect to e. 
a" Proof. uw 0)'=v'eu and uey"=¥"ouimplies (uey')e(luey")=(* ou) 
Tie Fa C4 then yields (ueu)eod(y' ey")= (ae A) ° (u be 1) and, by od 
(2) = 0%x)ou. In other words, y’ ey =" en"), therefore (uw) 
aut =7(y ew’). ‘Taking R13 into account, it will be seen that +(x) is an 
morphism with respect to e. Q.E.D. 
Ne 0 the sequel, it is required to show that +(x) is the identity transformation, 
nl oe all elements of the domain of a(x) are fixed points of this transformation. 
at ready known that u and v are such fixed points. It is also clear by R14 
ৰ্‌ ny element which can be obtained by successively applying the operation 
° any finite number of fixed points is itself a fixed point. 
OW the tentative assumption is introduced that the element xo With wu < x0 <v 
a fixed point of T(x), i.e. xo#T(x0). Then an infinite sequence of elements 
Ain, ao Ha, or Te defined by x;=T(x0) and xny1=T(n). It is evident by 
Gi R13 that u < xn <vforalln. Further, this sequence is strictly ETE 
eli that x, <x. It follows from R13 that s(x) <7(a1), thus He: Eo 
Sy ne of reasoning gives generally xn <%Xny1- Analogously, 2X0 he jl 
in Sack by the same argument. Since these sequences are evident y bounde 
in মর event, they possess limits, x=lim xn, which will be either suprema‘or 
ecreny, Pending on whether the respective infinite sequence is IESETE ডং 
and in fn (detailed proofs concerning the existence of limits, লা ES 
of this Ma, are contained in Kristof (1967 a)). The assumption and defini 
Paragraph will be used in what follows. | | ed DOE 
Of the If x=lim Xn, Xi = T(X0), Xny1 = T(tn), Xo given, then xisa 2 EN 
Which tomorphism “(2). Further, x is the ‘largest ’ (‘smallest °) fixed p 
্ Smaller ’ (‘larger’) than wo, whichever applies. 
of. (i) In fuller detail 
UueXi=X00Uu, 


Is not 
খ 


UeX2=N)0U, 


Ue Xn;1=NXnel 


these nd the Paragraph preceding R15, the left sides and the right sides of 
must pe tations * have the limits ue x and xeu, respectively. ‘These a 
© equal because the corresponding sequences are pairwise identical. 

ji) Stet, lie X=7(%) as has been asserted. | 
! Suppose that x, < xo, for example. Thus x < x0. Let # bea fixed point 
and, since x and # 


1 
% 
X a র্‌ 
Sy. Then, by C5’, yeu<feu<Neu=uen 

2 Ny 2 [!) 
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line 
ডি y i C5, <8 <, The same ন 
Fee en i enerally 5 চহ ডে Ee ea টা h Thi EE vaber 
A ce Joint as assumed above cannot exist. An analog 
Rael by EAE argument for the 3 dr ene PO a 
eo Lea El two elements Ea er 0) 
OE toe Toy of (2). Then, by R14, Tr )er (x 
: () Lt ef rhence ডে) (9) =1"(9' ex"). Sx’ = (x0) an 
For xo as assumed before, an infinite sequence of elements x’; 
X'ny1= (xn) is defined. 
RIi6. Tf Xn,1 $ xn, then ny Sy, 


respectively, for all n. cn iff 
Proof. R13 implies that (2) 


£ x 
i i Xn41> 
is strictly increasing. 5 et y 4s), 
3 i.e. iff x05 x’, iff rao) TE 
SEO CI a = ট 
X15 xo, i.e. iff (x0) S xo iff + “{(x0) }S (x), ie. ao r holo. applies 
i i ie / op . 3 
ie. iff x’, S's, ie. iff x ২২% 1 Continuing this chain y 
1 t 
of 7! gives finally x n12X'n. QED. 


Its can be 
gd হয 

By analogy with the demonstration of R15 the following res 

obtained. 


KL TE fig EA 1 =7 (x0), 
fixed point of the automorphism (2). 
fixed point which is 

The proof of R1 


wis a 
a then ! 
Xny1=T (sn), wo given, 


ST est ) 
Further, x’ is the ‘ smallest : ী 
“larger ° (“smaller ’) than xo, whichever app 
7 parallels that of R15 completely.  Q.E.D. 


tO” 
: int of the au 
R18. The Previously introduced element x9 is a fixed point 0 
morphism 1(2), i.e. = 1(x0). 


Proof. Accordin 


ey AN 
. | রি L— lim Xn 
£ to R16 and RIT, xSx, if x Zw with 2 (2) 2 
Y'=lim xp. Further, 2=4(2) iff 7-1 2)=2, i.e. the automorphis 
7™(2) have the same 


pain 
xe t 
j Tre x)= 19) ox) 00, thus xox’ is also a fixeC tha 
if x and x’ Are. Tt may further b 


i 1 

€ inferred from zx’, by Lod i fy 
EAE OE Re le. JSveusy, respectively. 50 tatemen 
fixed point “between ° ¥ and x’ which contradicts the previous $ 
‘Therefore X0=1(%0). O.E.D. 


This result can b 
mutative, then 1 and 


08 
commutative for U<2<v. Analogously, TE Te 
are commutative. This latter Statement could be proved in the 54 un (x 
It would first be Shown that defines a unique function j 
and then the pattern of the previous Proofs would be followed. ta an! 

RIS. Suppose Uev=y ou with u<v. Consider any two elemen 
Y, With u<x<w and USY<v. Then x» and y are commutative. 


d disregioh 
s of Senerality it May be assumed that x <y pa ve 
ive a LS Us Ti follows.fiom fhe abox! uta! 
that u and y are commutative and further 

Q.E.D. 


om 
that then Yy and x are C 


e COM 
d ar 2 
e reformulated as follows: If 1 and v with wu < d 
2 are 


Voy=xev 


the trivial case x= y. 
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other words: If 1 and v are commutative, then these elements limit an 
EO that any two elements of this interval, including the limiting 
» Are Commutative. 

Ther Let V be the set of all elements v which satisfy uv e U=Uet,1 given. 
tively. Possesses a maximal element and a minimal element, v’ and ”, respec- 
soe Let % be an element with x>uw and suppose that by and 1 are not 
Suppose th Then, by R19, x’ and u cannot be commutative fz >. Now 
" i the set of all such elements x with x>1u1 has a smallest element, 
v satisfyin USUer'zys ou. On the other hand, there is certainly an element 
condition Vent with v>u1 and < " since =u evidently meets this 
হি Oh: a 1s not empty. ‘The possibility vue x">X*"euis now considered; 
ernative would be treated analogously. From C5’, uex'>%’e 
ও Then, by C6’, there exists an element w with uew=x" ou 
Vv. By C5’, it follows that uew>%weu in contradiction to the 
Consequently, the set of all x as introduced above has no 
Then the continuity of M requires V to have a maximal 
OES ‘The existence of a minimal element of V can be proved similarly. 


M80 00s a Lo Tf x and Yy are commutative, so also are xe a and yeaas well 

comm. fd ao y for every aeM. (2) If xeaand yeaoraex and aey are 
ative, so are x and y. 

b ge | Ney=yex yields, by C5’, (vey) e(aea)=(yex)elae a), hence 

The pro ed)e(yea) =(yea)e(xea). ‘This derivation can be reversed. 
Dy. of the other alternative mentioned in R21 is analogous. Q.E.D. 

Sleme; Let A be the set of all elements x with u<x <a. Let B be the set of 

tatiye As with b <y <u. Suppose that any two elements of A are commu- 


nN . 
AUB ন any two elements of B are commutative. ‘Then any two elements of 
commutative. 


Itis only necessary to show that ye =x oe yifu<x<aandb<y<u. 
MAES d, an infinite sequence y, yu Y2, --s Jn, --- is defined by y1=y e ¥ and 
Youn et Evidently, y<u<wx. This gives, with C3’ and C5’, y=yey< 
%; hence y <y; and y, <x. By an analogous argument, it follows 


k 
The 1 ex and ie 
$ eg nn Yk_iex=xevyr i. R-—1 steps of this sort lead to the result 
Q.E.D. 
then 23. 


I i . . . 
all pai f there is one commutative pair of elements in M, te v=v eu U<V, 


rs > 
Of elements are commutative. 
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inimal 
থি 2 Sr a minim 
Proof. (i) For u given, there is a maximal element, 0’, ey other 
element, ©", such that every pair of elements with members TE ie 0B 
burween ©" and u or between 1 and ov’, including the respective limits, 


নয lements 
mutative. This follows from R19 and R20. Therefore all pairs of e 


ive by 
i i imi re commutativ' 
with members between w” and 0’, including these limits, are co 
R22, 


(ii) It is known that u <’ because of u<v. 


S ents 

Consider an element r with w’ <7 Sxmax. By R21 and R19, all pairs of neg” 
with members lying between ver and ‘er are commutative, Ue ah 
included. For these limits: u=ueuc<u e7, thus u<uer; and v 
Further, by C5’, uer<v' or. 

(li) It is certai 
0 <r. For suppos 
uer'> 


"< Nmaxe 
Now suppose that v <1 


h 
1 , cen thoug 
nly possible to choose 7 such that uer<v', eve 


- then 
a = an 7, 
e that an element 7’ is chosen instead of such « 
V=v' ev’ >vou 


rity 0 
ativity 
=4e0', by C3’, C5’, v'>u and the EEN Pe nent if 
u and v'. So uer'>v'>uev. According to C6’, there is then an 
with v'=u0r and r'>7r>%', 


S 
ye sf alway 
Hence the condition uer<v' can # 


ol 
llasuer and vor border ‘intervals ° [u, '] eer 
ch that every Pair of elements with both mem for the 

i is commutative. ‘The same le ve] 
Proper or improper sub-interval of ie ent, an 
[v', ve 7] share the element v' and are ACI egies 
ollows that all pairs of elements with members d 


g 0 as sta 
mutative. Butv'<v'er, go the 


Vmax. Ther 
Y an analogou 


are 


nt 
্‌ ‘ve eleme 
or (2) 1S no pair of commutative be 
Aas Any two celeme ন 
remembered that this Ea of 
A 


€ closed fine 
tx+1y. There is no EOrrtiUta of the Teal line [0, 1] and de 
(2) Define M as Pair 0 


1 
commutative. 


ents দ্‌ 


ativity theorem 
R24. Tf Msati initial a 


apply, then Precisely one of the follo 
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ie of different commutative elements or (2) every pair of elements of M is 
mutative. 
oe ! now be demonstrated that RI1 and R24, the main results, are inde- 
~~ A in the sense that each combination of the cases allowed by these theorems 
or TE Let M be the closed segment of the real line [0, 1]. The following 
old table is self-explanatory. 


Ton nn BR (Hf na 


| Commutativity 
| —— ———_—~ ন 
Reflexivity Always In no instance 
\ y 
Always x0y=Ix+14y x0y=4x+13y 
In just one ¥*0y=#x*+1y Xx0y=#x+3y 


instance 


A §' . 

Siven i 2 consequence of R24 it will be seen that the general formulation of C3 

mutativ Section 2 is quite redundant. It suffices to say that there is one com- 
© pair of different elements in M, the only alternative being the absence 


any s i 
. Y such commutative pair. 


6. SoME FURTHER RESULTS ON COMMUTATIVITY 


They ine boo ois some additional results concerning commutativity are given. 
order ty © Viewed as necessary conditions which commutativity must obey in 
that th. Consistent with the definition of M, C1 and C4 to C6. Itis believed 
5 it Conditions may be empirically checked for a set of data. 
tis nos Crivations will again be made in terms of the operation e. However, 
)- Substit, that the results on commutativity remain valid when 0 is eventually 
oe ent ed for o. Only the case permitted by R24 in which no two different 
- ne Are actually commutative will be considered. 
Owing theorem is first proved: 
bein Pairy Suppose thataex<xeaandaez>zea;all three elements a, x, & 
i Se different. Then a necessarily lies ‘ between ’ x and 2. 
trea, 5 (i) Tt is assumed that x»<z, since the other alternative may be 
(ise Can larly. Further, let a<x, contrary to the assertion (again, the case 
ons € treated analogously). Hencea<x<z. By C5’, the initial assump- 
is > ere is Combined to give aex<xea<zea<aer. Then, according to 
a Ane em an element r such that aer=ve0eaandx<r< &- Likewise, there 
nd so a ent twithaeti=zeaandx<t<z. By C5'x<zimpliesxea<zea 
an (ii) Eve @et, therefore r<t. Thus x<r<t<s. | ge 
fro ment an rE Y¥ together with the relation AEFI determines be 
™m Setting ie 7<s<t when aes=ye ais required. ‘The UnIENESS b 1 
©s=ae0s, which yields s=s' by C5. As to existence, s=7 for 


MN 


| 
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H = srefore it is t 
ished in (i). There Ea 
= =2 has been already establishe Ho, 
sp Ms oe Then, by C5’ and (i), it follows a নাল” and | 
NE ess re Hence, by C6’, there is an element s with ae 
yea <Zea= . 
< ed by 
ir? (iii) An infinite sequence of elements, sj, 52, ..., Sn, ..., iS de 
de7r=xe0a, 


aesi=70a, 


\ 
des2=5) 0a, 

Gesn;i=snea, 
‘This definition is 


. to 

a ording , 
meaningful. Namely, all elements s, do exist acc 

(i) and (ii). Further, by (ii) 


<? implies T<sI<sI<. 


is strictly increasing and b 
T<5<t. 


2 
ition, by 
» generally 7 <sn <t for all 1. In দলত লা on 
‘SS <.... Therefore the sequence 0 nan 8 
ounded by t. Thus it possesses a supre 


R26. (1) Suppose that a 


a, x, 2 being different. Then 


ts 
lemen™, 
eN<Yea and aez<zea, all three ¢ ff 
than a. (2) 


a 
H ‘ ater ’ or ‘sm 
both » and z are either ‘ grea 


oO treated, the proof of (2) being SILI 5 contrat) 
(i) Itis Assumed without loss of Benerality that x <2 and x <a 


‘+ies Y 
‘ inequalitie®, 
Combinin, ‘Orresponding sides of the above ‘ineq te 
by the Operation @ and taking C3’ 


Ver, 
i ©. of 
5 C4" and C5’ intoraccount BIVES EY iotld 
WS 


ntra 
+ Otherwise, namely, the co i 
dea<aeawould result, ive pold 
il) Thus cither x62 <a o rst alternative. py 
TrYer>a. Suppose the 
(the second case would 


=ze 
be treated “nalogously). Then xez<a<2= of 
(i) and C3’. Tt follows by C6’ that there is an element 2 with =) 3B 
¥<y<z2. However, Y=a is i 


=, 

2 1s impossible ec herwise a=ae a= ordis 
C3’ and so a=2 by C5 i টী, ion & (0) Thus ye Then, লং RY 
to (i), either x and y or x and y lie on the Same side of a. It follows it iS i { 
and the initial “inequalities that in each instance ae y<yea. But! tiie 
known thatae 2 <zea. i 


i “inequalit! a 
i Combining Corresponding sides of these ‘in 2 
by the operation @ and applying C3, C4 
which reducesto ae a<a 


CRA 
RE and C5’ yields a e (ye 2) <p J 
9dinviewofa= Yez. ‘This contradiction 
R26. 0Q.E.D. 


EES EI 
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RT. 1 
(1) If a<x and aex<xea, then b<z implies bez<zeb and 


Vice v 
ৰ ersa, (2) If 
Vice versa. ) a<x and aex>xea, then b<z implies bez>zeb and 


Proof. i 
() io he (0 will be proved, the proof of (2) being similar. 
Then, by R25 to’ hience b<a<x, i.e. both b and a are on the same side of x. 
Same side of b ogether with R26, bex<xeb. Since x and 2 are both on the 
FG re en aDalogously boise 
Sign = Can and suppose contrary to the assertion that be 
Same side of 2 apply in view of R24), so ze b<bexz. Since a and b lie on the 
¥ and 2 lie Fl ee by R25 and R26, 2ea<aez. Thusaez>#2eda. But 
Contradictio € same side of a, which again leads to the resultaex>xeain 
(ii) Vice to the assumption. 
Patible only STE it is clear by the same arguments that be2>zebis com- 
he 2. Q.E.D. 
let WM ST excluded by R25 and R26 do indeed occur. For example, 
fl: Consider Ry lie; the closed segment of the real line bordered by —1 and 
at this definiti 1€ Operation uo v= Ju+ v0, ueM, 0 eM. It can easily be seen 
oweon jer is consistent with the initial axioms. 
CEI 3 oo erR25. Puta=0,x=-l2=l. Thusaox= —{, 
Y Consider Roe Clearly, aqox<xoa, a02>%0%, V<a<হ. 
0a=3. Th 6. Take a=0, x=, 2=1. Thenaox=# ¥0a=% 
Ue to Ros AOX>X0a, AOZPEOA, A<X<E. 
7, the actual choice of a and x is immateria 


telat; 

ion 

fh. X0f aa ; - ডি 

Udied nd x is preserved if relations of the sort ao x 


2>ze0b (the 


x0a=-t, 
ao2={ু, 


1 as long as the order 
ৰ ইwx0a are to be 
s made of the fact that the results remain 
if 0 is substituted for e- 

d in the following self-explanatory 


In th 
Valig, ss OE examples, use wa 
to he el commutativity is concerned, 
Urfo]d table ts of this section are summarize. 


Inequality signs in a0ox$£%0a 
and aO2$£ 20a are 


Inequality signs in 


< 

2S ¥ and aS 2 are Same Different 
Same Possible Impossible 
Different Impossible Possible 


7. 

INTRODUCTION OF A NECESSARILY CoMMUTATIVE OPERATION 
validity of C2 and C3. How- 
btained by defining ee. This 
ivity which led to the 


6, he 0 ig 4 ত 
er, an o er 0 did not require the general 
ten lo ration which was also reflexive was 0 
Sults of tl abled a study to be made of the commutat 
€ last two sections. 
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vides 
i intended to introduce an operation which  ” il 
ht is Operation then satisfies all assumptions C1 I Tu arto eh 
belShouniorer that it is possible to assign on this basis numerical va 
Re of M such that an isomorphic map of M results. 


‘The following theorem is first considered: 


f tWO 
3 elements © 
R28. Let un and vp, respectively, denote the general eler 


ক nd ©, 
vith limits 1 a 
infinite and strictly monotone Sequences of elements of M with 
respectively. Assume further 


element une vn is strictly mono 


Proof. (i) Consider fi 


increasing (the case of 


© Un+1 
=D FJ In 41 
<n; for all 1. Hence, by C5’, une Eth the 
without invoking an additi ‘Therefore the Se for all 1; 
্‌ ৰ gj 

S:stttctly inoressing,. Then va and oy 


and w<uev, 
ation given to 


Vk 

9 

CS Uk 

strictly monotone SEM limits © 
wi 

tious, ue Uk, ... and vie Oks Uke VE, Ure VL, o, -.. Supe ORY 

and Uiev, by definition and by RS. Since, by G5, Ukyy © Vk41 

Ure Uo >i; e UF, 2, 


and so on, it foll 
4 (ur 0 0)=u ev by RO. Therefore ases 
> 0 incre 
(li) Now Consider the case in Which one of the initial SEqvenest, rth 
and the other decreases, i.e n;1<Un for all n. It be treat 
10, for all 2 (the other case would an) exists. 
Since, by C5’, uy eWmn<uev, for all nm, w=suplun e upltn od) 
| aev<un evn, SUevy, for all n which yields ued 
SSUp(in 5 6 on) <inf(u si ), ie. by Ro, Uov<w<uev, thus 
Q.E.D. 


€ 
enc 
k. H 
OWs that w> uj ev for eet 
WE<uevandw>uev, hen 


n), le. b 


Next, a distinction is made 


ive 
si 
exclu 
between two exhaustive and mutually 
classes of sets M for Which com, ivi 


th 

equen 
Mutativity of the Operation e (and cons 
Senerally hold. 


7 id to be Of class 
relation x e)<y ex holds. M shall b 
% <y, the relation x» Y>Yyexvis satisfie 

It follows from R27 that this defi 
The class membersh 


elements of it. 


j 219 
Sets Admitting a Concatenation Operation 


Class 1. uppose x and Y are dilterer t elements of 1 : 
H £f. 6. Let M be of J I 5S p diff 1 VM 
LE, x> 


Then x0 ) 0 infinite 
i i ef. 5. Two! 
bon en xe yzyew, respectively, a pore be Mies 
n fined recursively b 
Sequences with elements xn and yn are defined recur y 


1 =Y ex 
X= 0), and Y1i=Y 
12 = Y1 ® Xy, 
Hite Ju J2=1 
'n;1 = Yn Xn, 
ni) = Nn © Yn, Jn ) 


¥ Then b a! F = E; FE: ) ) = Y, thus 
XY, lk 2 y C3’ and C5, «x Nex<xey<ye)) 


f ass I), <x) <1 <. 
+ Similarly, x <yi<y and by 8 দক , i It follows 
Further =X ov, <x, 0 y=, hence vi <xs. Simi hence V2 < Ys: 
ie oa Xie) (2 @ 11 i 
lian DY the definition of class I ন ্ a EES ans tn Sas. ANd. 
line of reasoning shows that HoT ERE Hence eseto xn and J =inf yn 
Le ‘‘: SY2<Y) <Yy with xn <yn for all n. b 
exist, Y2<Yyi<y n ~) 


€ same 


air of elements x,y. This 
it is associated with the (unordered) pair of hoes ee Re 
ett is indicated simply as x Ay. It is obviously unig 
and y and lies between x and y. | NMTPAIETEE 
In thi ES formally inserted in the above ‘ equations ’, 
ts “চলক পক i ' his operation 
Satis, San operation A has been defined. It will be shown a» Ee LS 
eres ie Assumptions C1 to C6. For formal reasons, the 
| : i ion A. 
Ln to as CI" to C6” if they involve the derived operatio 
c follows at once that 


Vl 


i i lement 
e uniquely an e 
ত Cl, Any two elements x and y of M determin 
Pay of yy 
C2" 
Ay= a 
C3" HY Aw 


i ki0ms) is not so immediate. 
ificati ini ns (axioms) is no 0 
The ro Verification of the remaining assumptio f Ts Stell thence Te 
Sh Le owing derivations refer to sets M of class I un! Ww 
pfu to 
onside Fe | J UAV)=(xeu)A 
29. Onsider four elements, x, YU, 0. hen (x Ay) e ( ) 
La) 0). 
‘oof. Ut Ww=yeuand 2=y0vand define 
W=Wwer, 21=20%, 


= Ww. 
Wi=Wi eT, 32= 2) @ Wi, 


' = 2p ®@ Wh, 
Wn; 1 = Wp © 2p, 2ny1 fl n 
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By Det. 6, lim w, =lim 21=%w A 2=(¥ eu) A(y ev). But the above ‘ equations 
may be rewritten as follows, using C4’. 


wi =( ev) e (yd) 21=(y 00) e (x0) 
=(x ey) e (ue) =(y ex) e (vou) 
=X) eu. =Yi ev. 

Wo=(X, ui) 0 (y; 0 ) 22=(y,00,) e (x, 6 ui) 
=(4, ey) 0 (uj ov) =(y,ex;) eo (v, oui) 
=X 0 U,. =y2 0 0s. 


Generally, for all n, 


21 = Yn © Un. ion 
In these reformulations the definitions of Xn, Un, Yn and vn, follow the not * 
used in Def. 6 and are therefore not believed to require further explanation 1 
Now it follows from R28 that lim wn =(lim Xn) 0 (lim un), i.e. remembering 
Def. 6, (x61) a (y e0)=(% sy) 0 (u av) as has been asserted. Q.E.D. 

Now the following is Proved: 


TE STEALS Ff 
Bein Ca iff x AYSx Ay for ally. Also, YSyifsnsy xn) for all 
() We RR to prove the first Statement in view of C2". 


Wn = Xp © Up, 


Y'1=x' ey 1 , 
ক $ Y1=y0e%x, 
CATE 
2=%1097)';, Y'2=Y',0 x’, 
Nn = 0 if 
AY 1 'Y 
s: ney n Ynj1=Ynexn 


Correspondingly, 
%1=2 ey, Y= 0x, 
%2=%, eV, Y2 =Y; e %y 
(at 9), In =YJn © Xn, 
By Def. 6 ¥' Ay=lim x’ X ESS 
0 and x Avy=1; চ! Ht os ai ABE 
and Ins Yn, respectively. It de fe Using C3’, x 5% peo, Vn 
i iy at x=yxi i / = 8 ie 
; (ii) Consider the IE NEE, TE Slow ও শু" <lim Xm 
¥ AY<Xx AY. Now Suppose that S only that lim xn < i 


{s 
Se nt 
X<Y <. We obtain a a EX < =XAYy. Let x” be any eleme!, Ay 
XAY. PSA y= Ay, therefore x Ay= 

(iii) Choose an element TED 
of elements x's oy, oy ox. d ye loa LE 

i; fe ys 
possible to arrange that x” 1 And generally x"n,,=x" ex nn: cas 


#Y (just t i $ ড thi 
separately). From C5 it idols LG the necessity to discuss an’ 
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y Cie by 0 So it has a limit. RY gives lim x"n=%" elim xn. C3’ and 
yield "=lim xn. Therefore, by taking a finite index k large enough, it is 

l Possible to have x'<x"i<x. Then, by (ii), x": AY=5 AY. 

| iv) Remembering the definitions in (iii), R29 and C3”, 

XL AY=(" ey) AY=(X" Ay) ev, 

say =(x" ox) y=" sy) e (xs 3), 

Ns LY=(N" 0x2) Y=" sy) 0 la"2 23), 


a xray ex) y= apes. 
then UE EE ¥ by (iii) and x" sy=% ay by (ii). The last K equation yields 
before া ! and C5, XAY=X" 1A). Introducing this into the equation 
analogou g last, it is found that x Ay=X"r 22). After a finite number of 
in Def S backward steps the result x A y= is obtained. However, it was seen 
View of 1 X Ay must lie between x and y if x#y (the case #18 trivial in 
0») ST requirement and C3”). This contradiction proves ¥ AY <» Ay. 
Sy im yy it may be shown that x'>x implies x Ay> NAY. But if 
tions ae ies x’ A YE x Ay, respectively, then the corresponding reverse implica- 
C6" Obviously also correct. Q.5.D. . . 
§ With 2 If, for a and b given, x A a <b <y 2 a, then there is a unique element 
deme Aa=b. Similarly, aau<b<ansv implies the existence of a unique 
Le WwW yithanw=b. 
Le In view of C2”, it suffices to prove only the first part. 
of all e et A be the set of all elements x satisfying x Aa <b and let B be the set 
*' SETS Y satisfying yAa>b. Then, by C5", x’ <x and xed implies 
empty pSimilarly, y'>y and yeB implies y'eB. The sets A and B are not 
The dq assumption. ~ Suppose that an element with # A a=b does not exist. 
CUR ue to the continuity of M, A has either a ‘ largest » element and B has 
the re SS ’ or A has no ‘ largest ’ element and B has a ‘smallest’. Suppose 
large LC Case holds (the second would be treated similarly). Let x" be the 
clement of A, i.e. x" Aa<b and y 5 a>b for all y withy>a. 
An infinite sequence of elements is defined by x") “” 0), 2=X PET 
ad e%x’n,.... C5’ shows that this sequence is strictly decreasing and 
tel yr ~ by %", so it has a limit. RY 3 X’n=X"e lim a C3 and ig 
Ya) a lim x". Since xn > Nhgi2all n, # follows that v"neB and 0 xn 2 a> 


gw V) Nsider the infinite sequence of elements x"nda. From R29 and 


> 


(ii 


i Aa=(x" ey) a=" a) e (y 2 a), 
X"2ha=(x"ex"i)Aa =(x" na) e (x, a) 
x0 a=(x" ex") Aa=(x"ia)e (x2 A a), 


Xana =(x"ex'n)sa =(x" 5a) e (xn Aa), 


5.P. 
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ডী “ and boun- 
i i ing by (ii) and C5 fir ). 
f elements x"n A ais strictly decreasing | 0 SE 
(দশ hy Thus it has a line RY yields lim(xn 2 a) =(%x" Aa) oe lim( xn 
By C3" and C5" it is inferred that x” na =lim(x", 4 a). 


Cs" 
i af LG b. 
(iv) It follows that there is an index k such that x” 5a i ৰ Spption of 
ields x" <x"k, further x"peA. This contradicts the maximality ted property. 
Sn therefore proves the existence of an element 2 with the asse 1 implies 
Ed =I A 
(V) The uniqueness of 2 follows from the fact that 2a 
)- un হট 
2=2' by C5". o0.E.D. a OB 
R30. Let un and vy, Tespectively, denote the general fe and 
infinite and strictly monotone Sequences of elements of M with cl element 
respectively. Assume further that the infinite sequence with the rn) 
Un £ Vn is strictly monotone. Then lim(up A vn) =(lim un) (lim oe of R28, 
‘This theorem can be Proved in complete analogy with the Pp 


= k QED. 
© being replaced by 4A. The Proof does not involve bisymmetry 
C4". Consider four elem 


=(x251)2 
ents, x,y, u,v. Then (x Ay) A (wu a 0)=(s 
(y a0). 


Proof. Set W=yY Auand z 


C5. 
BE p uenc 
=Y Av and define the following infinite seq 
Ui=wez, 2,=20w, 
V2=w, 0 2), 22=2) © wi, 


Wn; =w, © 2p, ny 1=2p © Wh, 


By Def. 6, lim wn =lim 2n=w z= 


l «equa 
i (au) (ys 0). Using R29, these 
tions? may be rewritten as 
=(9 1) o (y sv) =(y Av) oe (x Au) 
(ey) ave) =(y ex) A(v eu) 
=%, A uj. =Yy; a 0. 
P= (a Au) o (y, sv) 2=(Y, Av) o (x, aus) 
ie ey) us 2) =(y, e Xi) Alvi eu) 
=X Aus. Hl | 
Generally, for alln, YA 0s | 
FD) 
Wot Ay, EE ক  & 
In these reformulations 1 =Yn A vp 


0. 
the definitions of w the notat! 
employed in Def. 6 and Are therefore not bell Ed iy fe explanati? 
Now it follows from R30 that lim wo = ws a u) a (y a v)=(lim 2) * 
(lim un) =(x Ay) ns (u Av) as has been asserted. os D 41) A(y 

The results Obtained so far a A 
and Def. 6). For the al 


5 
Def. 
PPly when Mis of class T (remember 
definitions are given: 


wing 
ternative of yf belonging to class I the follo 
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ee Def. 6’. If Mis of class II then the operation » is defined by x a4 y=lim xn, 
re 


X1=Y 6X, i=), 
¥2=Y1 6 01, V2=N) 6 )v 
Mny1=Yn © Nn, Jny1= Xn © Yn, 


that llowing the patterns of the previous derivations it may be readily shown 
Og0us results obtain from Def. 6’. 
members; may be said that in general the operation A y regardless of tHe class 
ting the 1p of M, fulfils all assumptions (axioms) C1” to C6”, therefore exhibi- 
© definitio Same features as did operation ». It should also be noted that the 
requir Nn A was based on the introduction of eo, the latter only lacking the 
ment of commutativity. 


S 8. NUMERICAL REPRESENTATIONS 
ae Of the previous results will now be used in solving the scaling problem 
Values ES the question of existence and uniqueness of an assignment of numerical 
attentio, the elements of M such that an isometric map of M obtains. HONE 
onve ‘1 Will be restricted to linear representations because of their numerica 

lence. 
“ ধা M will be said to be linearly scalable with respect to order anda 
nation operation o if, for every 2M, a real number s(2) can be found 
Lo (c) i tab is equivalent to s(a)2s(b), respectively, and c=a 0b is equivalent 
TE) + B.s(b) +y, where «, B and +y are real constants. 
95 as been shown in a previous article (Kristof, 1967 b; also Pfanzagl, 
Th e that a linear scale of M can be obtained when the operation » is involved. 
*Pression for s(x#y) was required to be 

(xy) = 3 {s() + (9) } (1) 
rE Eo correspond with one’s notion of taking the ‘middle’ between two 
on * and y. It has also been shown that any such scaling function s(x) 
Drees SSatily Continuous and that the group of admissible transformations, 
D8 the form of (1), is the group of all positive linear transformations. 
TES 1S Obvious, of course, that eqn. (1) constitutes the only possible linear 
yo for (xy) in terms of s(x) and s(Y)- Namely, setting s(x#y) = a-s() 
bit,” leads to («+ B-—1)s(x)+y=0 by means of reflexivity. Due to the 
< Et Of s(x), the left side must vanish identically, which gives u+B=1 
| ommutativity then requires (« -B) {s(x) -s( Y)}= ৰ = ne 0 
Thi (9) 2 (3). Therefore a= B=2, which establishes the orm of eqn. (1). 
Satisf, © result applies equally when the operation A 1s involved since A 
- assumptions C1” to C6” and therefore is formally indistinguishable 
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ঠি ; ; inuous and 
from an operation #. Hence there is an order-preserving, continuo 
essentially unique scale s(x) such that 


2 

(x ay) =3 (s(x) + s() }. | ls 

‘This is the only possible linear form. This result is used to establish a ট oF 

when the operation e is involved. It only remains to secure an ES 
s(x ey) in terms of s(x) and s(y) on the basis of the above results. Su 


$ To 
expression must exist, since s(x 0 2) is wholly determined by s(x) and s(y)- 
this end, R29 is invoked. 


(3) 
(x 59) 0 (u A 0)=(x 01) A (y 00). and 
The continuity and monotonicity of s(z) ensures the existence, continuity 
monotonicity of the inverse function s 


ide 0 
“(.). Applying eqn. (2), the left Bice 
eqn. (3) can be written as 


শ[গ্ [ dl 


Similarly, the right side of eqn. (3) becomes 


2 (s()) e 571 (su) J] + sls) } o = (slo) 2 
E 
These somewhat unwield: 


Y expressions m: 
(y) =, S(u)=p, sv) =. Further, 
function s(.). This gives 


s-1 £+ 1 f+ S{S7HE) o s-1 i 0} 

[5 )es t)]- (6) ter 0, 
Now define S(g, M)=s{sU(E) os 
continuity of s(.), $7(.), the op 


i =&, 
ay be simplified by setting 0 he 
both sides are taken as arguments © 


টী ্‌ the 
() }. This function is continuous due to 
eration e (C6) and M. Thus 


ft nin) S(G, gS 4) 
(5, Et) = E+ Sn.) ) 


This i ’ 5 0). 
Te oo sen functional equation for two variables (Aczél, 1961, P- ঢং 
Y continuous Solutions are of the form 6) 
A S(£ n)= a€é + 
With real constant ৰ Bn+y . WS 
that ss, 8, y, Considering the previous substitutions, it follo 
Monotonicity implies ey) = s(x) + By) +. | 
and y=0 with a=0) ee Be 2. B>0. Reflexivity Obviously requires fe a ing 
linear expression is Obtained excluded. With these provisions, the fo 2 


(xe J)= 6) 
Thus s(x  y) is the quasilin s(x) + Bs). 


00 
Car mean of s(x; iy ing derivat! 

followed a pattern set b A (a) and $(y). ‘The foregoing | et) 
equation. Y Tez (1961, P. 199) in his study of the bisym™ 


ED 
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The numeric i 
gl al value of a in e 6) 1 i i 
ws qn. (6) is determined by any tri le x,y and 
ey when xy. Namely, eqn. (6) yields রয় চহা 
(xy) s(y) 
X= ——_ 
Ee 0) 0 
“ 5 i 
15 the Je seen that eqns. 2) and (6) satisfy eqn. (3). Thus eqn. (6) 
and (3) ex expression indeed, and it is the only solution based on eqns. (2) 
fansformate Es for positive linear transformations of s(x). However, such 
form of e ত evidently do not affect x, which follows immediately from the 
Non ES ). Thus a linear scale of M exists with respect to the operation e . 
bare enh required to show that all scales linear with respect to the operation 
en this m y the manifold of scales s(x). To this end, let (x) be such a scale. 
means, in ust be a monotone and continuous real function of x. Linearity 
analogy to similar considerations above, 
With oy (x ey) =o (x) + BUY) 
function le «#0, 1, a>0, B>0, « a certain real constant. 
Usi (.) exists and it is monotone and continuous. 
ng eqn. (3), 
tl a 
Uo ons hop SHR PALABT Bn) } 5 t {a() + BO) }- 
, 1) = (y)=, tu) =, (0) =v and introducing the continuous function 
(€) A (1) }= (x ay), leads to the result 


(8) 


The inverse 


LT a. T(E, 1) +B: TU, 7) = T (aE + Bus on + Br). 0) 

that ios a to solve this functional equation in two variables. Put =”, 0 
v=. Keeping 7 constant and then setting f(6)= T(E, 1), gives 

10) 


Accorqin of(€) + Bf(n) =f(at + Bu). 
function § to Aczél (1961, p. 69), the general continuous so 
al equation with «a+ B=1 is 


f(E)=AE+D 


lution of this latter 


(11) 


With 
Ho constants A and D. 
Wwe 
ever, when % is not held fixed, then 


(1) and Dy) thus A and D may be functions of T, 


T(E, n)= An)" + D(). (12) 


Usi 

In 

ue fe (12), eqn. (9) can be brought into the form 

wit BD) - Deen Bo) =a Aan Br) ACD Bell 40), 

Squation 7, and v are allowed to vary independently. Since the left side of this 
is evidently independent of £ and pt, so also is the right side. Therefore 

and c Aan + Br) — A(n)= Alan + BP) — A) =0, (33) 

onsequently 
aD(n) + BDO) = Dan + Bv). (14) 


6 Walter Kristof 
22. 


f , remembering 
It follows from eqn. (13) that A(.)is simply a constant, 4. Then 


i Sd 1961, . 69), 
(11), D(n) must be a continuous function and, again by Aczél ( Ls 
eqn. 2 


(15) 
D(n)=By+C 
with real constants B and C. So finally, from eqn. (12), (16) 
T(E, 1)=A4E+ Bn + C. 
Considering the previous substitutions, this becomes iD) 
(x ay)= 4t(s) + Bi(y) + C. 


It is immediately found that eqn. 


(16) solves eqn. (9). nbs sl 
Reflexivity and commutativity of the operation a» require A= 
C=0. Thus 


(18) 
(os Ay) =3 {0) +1) }. 


s itself 
embership of sets M as introduced in Def. 5 ET es I an 
«in eqn. (8). Itis easily seen that a> 43 if Miso biect to any 
«<? if Mis of class II. is a fixed constant and DOE articular 
transformations as is (x), depend upon the choice of a pz 

(x). 


At any rate, a 
i.e. x does not 


ss 
iquene 
‘The existence and unid! 
With respect to this Operation 


to the 
will be examined. ত 40) 
X0Y) in terms of (x) a 


(1 9) 
¥oy=(x0y)o0 (x °). 
From eqn. (8), 
(soy) = tt ou) 4 Buy) } o 1-1 {ot(s) + Br(y) J. as 
Hence (x0 Y)is a function solely of u(x) + BU(y), which is written simply (20) 
(x0y)=g {at(s) + B(y)}, 
with the unknow 


n function £(.). 


because #(.) and the Operation 


continuous. Th 
continuous. In 


:nuyous 
uo 
hi , nd contin is 
1s function is monotone a: 
O are mo: 


and 
function 
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Taking b i 
8 both sides as arguments of the functions 1(.) and g=(.) and using eqn. 


(20) finally gives 

This funecionel og {t(x) } + Be {1(9) }=8 Lot( a) + BU) }. (22) 
ls equati i 

Continuous golition is ন kr te” Ld athe E20 aii 


Eand H bei 80) }= En) +H, 
Cing real constants. Using eqn. (20), it follows that 
Where F (x0y)=Ft(n)+ GUY) +H, 
(v0) di and G=EB. However, eqns. (23) and (8) must together satisfy 
e))o(xey)} according to eqn. (19). This leads to 
thus F (Fa+ Ga F)t(a) +(FB+ GB — G)HY)=0, 
u+Ga— 
his lead Gu—F=FB+GB-—G=0 because of the arbitariness of (x) and ty). 
S to the constraint 


(23) 


F ua a 
EE = EE Gt 24 
Anoth G 8B l=n &%) 
er EAE 
With x Constraint is imposed by eqn. (23) itself when any triple x,y and x0 
Finally, there 


Sa NE fe chosen so that (x), #(y) and t(x 0 3) are given values. 
as required ER due to the existence of precisely one reflexive element, 2 Say, 
ements of y R11 (no consideration is given here to the case in which all of the 

M are reflexive because this has been covered by eqn. (8) involving 


e operati 
ratio. 3 ম 
nm @). All three constraints are formulated as the equations: 


F(1-)-Ga=0, 
F(x) + GU(y) + H= (x0), 
Fi(2)+GU(2)+H= (2), 
ossible to achieve. This system of 
knowns F, G, H possesses precisely 


anish: 


(25) 


arr, a 

ipy Bin 

iahomoge or Xx 0y#2, which is of course p 

2 solution Ee linear equations with the un 
ecause the determinant of the matrix of coefficients does not V' 


Iss: se 0 
(x) 1) 1] =o) +(01- ty) H(s)=tlne ») 1). 
Bu ট্ (2) (2) : { 
tecalline” / #2 implies t(xey)—#(2)#0. For, suppose that xey=2. Then, 
tontrag; An (19), x oy=(xey)o(xey)=#0 2=2, ie. %0J=2. This 
tonicity of {(.), (x e y)- (2) #0. 


dicti 
iction proves that x e y #2 and, by mono 
Jution F, G, H indeed. In other WOrds, 


he choice of a particular A.) 
with respect to the operation © 


Th 
Se 
these টল (25) possesses precisely one so 
h NE are wholly determined by t 
AS bee the existence of a scale of M linear 
1 established. 
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f ন! 
5 EE the uni £ are give 
ideration is given to L ns 
a" ED hon that all scales linear with respect to the op 
required to 


1 f scales s(x). 
iv: he manifold 0 aad 
3 (x) and so, equivalently, by t noone 
YS mi eT = a scale. Then this must be a mo 
ah sen real function of x. Linearity means 


(26) 
(0y)=Pr(a) + Gry) + H 


y a certain con- 
R me real constants F, G, H. These constants must ele which 
Es Male from eqn. (19). Namely, r(x 0Y)=7r{(xe))o( 
becomes, upon applying eqn. (26), 


(27) 


r(x e))= Fro) + 


Setting a= F/(F+ G) and B=G/ 
«>0, B>0. Therefore (x) is 


1). 
F+G™ 


1 
0, 
(F+G) gives a+ B=1 and, of ao a af 
2 particular case contained in the 


ent 


0 positive linear 


7 Were chosen). Consequently, eqn. (26) 


"(0)+H 


When T()=M(.)+u, A>0, A and « real constants. 


t 
It is easily found tha 
HG fu 
woy)= Fra) + GH) + EE tpn k 
Equating this with eqn. (26) gives 


(28) 

=F; @=G; TEA (FG De idently’ 
Hence F and G are not affected by admissible transformations of 7). TD iculah 
by a suitable Choice of Aand , H’ will assume any Prescribed value. Inp 
it can be made zero. Hl will be 

This Paper establishes a formal theory. Special applications bor 
considered in another article. F 
physical conti 


SY 
OF example, the derivation of scales ন enters 
Tua may inyojye sn CPeration e when a time-order ef 

€ Comparisons of stimuli, 

€ responses in e 


. uli 
| Xperiments whi 
“halving ? Perceptual interval 


‘ent 
ES S, then, perha j f 
well describe the Situation. Tt has been sh. 
not necessarily preclude scala 


bility of a domain. 


i it is 
i ular, 1t 1S 
queness question. In partic 2 
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A HY COSNT 
N EVALUATION OF RASCH’S STRUCTURAL MODEL FOR 
TEST ITEMS 
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and A. V. EVERETT 


University of Queensland 


nel item analysis model for intelligence tests was examined using a 
874 ee omnibus intelligence test, administered to two samples of 608 and 
provid Jects respectively. Five separate hypotheses were tested. Findings 

e evidence for Rasch’s claim that the difficulty level of items and ability 


level ind; £, 
Vel indices are independent of the sample on which they are based. 


K 1. INTRODUCTION 
T Rasch (1960) proposed three probabilistic models for use in test construction. 
node) + 1s concerned with accuracy in reading tests or, as Rasch calls it, a 
Specifically fon a n8 ’; the second is designed for speeded tests, or more 
items, 1 Or reading speeds; and the third for item analysis of intelligence test 
Th tis this third of Rasch’s models with which this paper is concerned. 
fen SOE feature of Rasch’s probabilistic models is that he abolishes 
ndividual als Which limits any findings to a particular sample or population. 
Of the diffe ility estimates are derived which, Rasch claims, are independent 
estimates culty of particular test items or ofa particular test. Item difficulty 
Particy a are also derived, which are likewise claimed to be independent of the 
distrib group of subjects involved. No recourse 1s made to the normal 
EaBUEeT Or to any other distribution. Rasch concludes that ‘ the concepts of 
iffer a introduced through the definition of the two types of parameters 
tests » adically from those employed in the psychometric theory of mental 
‘ Rasch’s work attempts ‘a new approach to test-psychology ’. 
0 date, Rasch’s work has attracted little attention. Besides Rasch’s own 


8s (1960, 1966) the authors have sighted only one unpublished paper and 
then, how well Rasch’s model 


be briefly the model and 


Writin 

0 
revi, ক 

fits En ew of Rasch’s work. Before examining, 

Rasch irical data, it seems necessary first to descri 

eG of the model. 

to each ibe is applicable to any situation Ww 

a number of items, responses being T 


here a group of individuals respond 
narked either right or wrong. 


J. Anderson et al. 
232 


but 

lized form, 

del in more genera enlateds 

has recently presented the mo are postulated: 

«a ate here 13 with Zero-one data. ‘Two Life ability 5 

the bility level of individual i or of group of individuals 2 5 Le mondtoRie 

i i i J. Both parameters are assu 

difficulty level of itemj. Bo P le I 

and 5 Fico Of ability and of difficulty respectiv oo values of the para 

eh makes two assumptions. F irstly, that given t e hats probability 
meters, all AT SWers are stochastically independent. Secondly, tha 


Ml the ratio 
ing i j i ction of 
of individual i (or group 1) answering item j Correctly is a fun 
aildj, i.e. between the d 


f the 
values 0. 
difficulty of the item; but the Probability is not dependent on the 


. are 

necessary properties of the চক পা to 

0 1 as aijd;, theoretically at least, has the the prob- 

he function (1+). Hence, 

ring item j is given by: yd 

Py= ald 2 i ‘ 
1+ aild; ai+ dj 


Orrectly answe 


Similarly, the Probability of individual i not answering item j correctly 1s: 0 
1 Py =djf(a +d). 
From eqns. (1) and (2) Rasch derived his basic model: (3) 
Pyf(l-P;) =aild,. 
To check th 


control which, h 
nature of which 
it is assumed t 


g simple 
€ validity of this model Rasch Suggests a RT exact 
mits rests upon approximations, a Firstly, 
little is known, Again two assumptions are ma individua 
€ of the ability parameter a for an d that the 
is his raw Score. Secondly, itis BRE correct 
fliculty Parameter dis the ET model in 
Tesponses by individuals doing the test. Keats (1967) explains ad items 
i ered groups of Subjects define the SNS in group * 
entries are the Proportions Pi; of SUECT tte the 
em J correctly, For such a matrix it is possible to es 
Probabilities in, and (2) and from eqn. (3) thus obtain: 4) 
Pyy/(1 7Py)a aid. 
ign is used b 
classification of subj 


; r 
Pij=0 or 1, itis ক লা edn 
id with: OF to group Subjects and/or items, bu J 
d within Some Approximation. istic trans 
y ms of both siges of eqn. (4) and denoting this logis 
formation by lij we Obtain: 


5) 
lyalog {Py _ Piyj)j}s log a, log d,. 


f 
s degree © 
gree of ability of the individual and the 
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The logisti 
the Goren Les Ee matrix is shown in Table 1. For all m items in Table 1 
for the first দি it el Sl of subject groupings is the same. For example, 

Ll 

and the de l,j =log a, —log dj — (log as— log dj) =log ai —log as, (6) 
approximate! Sa between the left-hand sides of eqn. (5) should likewise be 
if we plot to the difference in eqn. (6). Graphically, this means that 
and the interc against row 2, points will lie around a straight line of unit slope 
cach ability Rl on the ordinate axis yields an estimate of eqn. (6). Because 
estimates Ob So based on relatively few observations, the precision of these 
Y plotting ah comparisons of pairs of the n groups can be increased 
| which data Fe পা! group against the mean of all ability groups. The degree 
e goodness oF to fall on straight lines of unit slope is in effect the test of 
t of the Rasch model and, if achieved, column means provide 


Sstimat 
tates of the di ; 
Provide estimate ST of items plus an arbitrary constant while row means 
Tiss s of ability groupings plus an arbitrary constant. 
i Lio 
SE "TRANSFORMS OF PROPORTIONS OF ORDERED GROUPS OF 
JECTS GIVING THE « CORRECT ’ RESPONSES TO ITEMS 
3 Items Means 
fd Tosa Bd ee FE কুন m 
1—logd, ... log a, —log d; ... log a1 — log dm log a, —log d 
2, 1 i 
Og as —log d; ... log aa—log dj ... log aa —log dm log as—log d 
Sub; : : le " }! 
Jects i ৰ " 
log ai —log d, ... log ai —log dj ... log ai —log dm log ai —log d 
a = j | ৰ 
log a,—log d, ... log an—log dj ...- log an— log dm log an— log d 


M = 
ন eans loga —logd, ... loga —logd; ... loga —log d;j 
TOOk. হ a a 
mode] by (1965), using two different ability levels (school grades), tested the 
eans of both verbal and non-verbal items from the Lorge— Thorndike 


lligen 
ce Test and found that 27-2 per cent of items checked failed to fit the 
did not use items that were com- 


te 


ode] 
Pleyc, 2 the 0-05 level. Brooks, however, 


e 
y 
a to both ability groups. 
del has also been tested by Kearney (1966), who found that 20-4 


ber Cc 
he 0-05 level (4-08 per cent at 


ent . 
the 0. Of items tested failed to fit the model at t 
level). 
n nei 
Meet the study were the indices recalculated after 
Sion of নল were removed, a procedure suggested b 
In t paper. 
lowing F present study the Rasch model 
i ich of EOEeses (1) Item difficulty ind 
Muenceg po ate based. (2) Indices of item 
y other items in the test. (3) Item difficulty indices are mo 


items that had failed 
y Brooks at the con- 


was examined to investigate the 


ices are independent of the sample 


difficulty are not substantially 
re stable 
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When only items that fit the m 
are not influenced b 
indices are independ. 


i ন ifficulty 
odel are considered. (4) Indices ng Jevel 
iti ili ings. (5 
the composition of ability group 
Ce of the ন On which they are based 


2. METHop 

‘The test used was a 45-item s 

used for Screening recruits Who ap 

Royal Australian Navy. It is use 
for officer training courses. 

The test results of two sam 

of 608 recruit applicant. 

time system of milita 


est that is 
i i intelligence type test he 
1 omnibus intelligence } Re 
লগ Join either the Australian ee arply 
d for all applicants except those 


consisted 

ples were investigated. ‘The first oo is a part- 

i Military Forces (CMF), be ts to the 
» 874 recruit on: General 

Royal Australian Navy - The second sample টে CMF) and 279 

Service recruits, 129 Reservists (the Navy equivalent o 

recruits to the Women’s i N. 


it-half method on a 
’ USIng the Spearman—Brown split-half me 
sample of 507 recruit applic. 


Kuder-Rich 


» COrrespondi, 
mine cut-off Points. Be 


trix was fu 


Or item was deleted. 


fier, 57 
nall 12 test items 
rounds did not fit the model. 


ion matrix. 

1S reduced 6x33 matrix was Converted to form a PrCPOr forms 

‘The logarithms of P/(1-P) Were then calcula ed. The logistic 

(lj) for the CME Sa in Table & 
le values and abili 


I 
anne 
in the m 
ErOUp scale values were calculated in 

he Previous Section, ie 

Was subtracted (this is j 


a constant 
0 Wg Bil oohnnn means 

S1s just a question 

hese results for th, 


mple a 
BOodness Of fit of 
graphs of items (i.e. the Columns of Tab 
the row means of Table i 


re calcu” 
SET : the © following Brooks (1965), tests we 
lated to Investigate Whether th 


ions £0 
ৈ ° regression of the six logistic ie trea 
each item departed Significantly from unit Slope. It had been hope ntaine 
each of the RAN Subsamples Separately, pit too many cell entries co 
Proportions of zero OF unity to make th 


tly 
uen 
€ analysis worth while. Conseq 
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TABLE r 
2. LocisTic IT'RANSFORMS, ITEM AND ABILITY SCALE VALUES FOR CMF 


j 

5 1 2 3 4 -] 6 means values 
6 NES 1-355 1-672 2.076  1:663 1813 1-541 0:0546 
7 05 4 0:785 0:805 0:915 1-405 1-505 0-933  0:2213 
§ 0 i 1-355 0.934  1:362  1:505 1633 1-220 0:1146 
10 03 0:785  1:°260  1:362 1405 1633 1154  0:1330 
ES 84 0.785 0.845 1-146 1-322 2117 1.067 0:1629 
1 073 0-384  0:527 0-996 1322 1-633 0:792  0:3062 
13 a =0:186 0-073 ০0:269 ০0:570 0:926 0:232  1:1120 
14 261 _0:186 0:407 ০0.580 ০0:926 1-405 0:478 0:6310 
15 70184 0.212 0-637 1041 1086 1813 0-768 0:3236 
16 70527 0-061 0-342 0:726 0:860 1505 0-495 0:6067 
1 _0:342 0:212 0-364 0-845 1-086 1-322 0-581  0:4977 
70-301 0.377 0:340 1-146 1190 1813 0-776 0:3177 
70:527 0.111 0-128 0.649 0:962 1190 0-419  0:7228 
70:342 —0:111  0:364 0-649 0.748 1-322 0:439  0:6918 
70-527 —0:212 0-281 0-726 0:962 1633 0-477  0:6324 
70.580 _0:-377 0:241 0-497 /0:892 1190 0:311  0:9268 
71.362 —0.037 0-073 0439 0:774 1505 0-232  1:1120 
24 7362 —0:536 —-0:241 0:206 ০0-590 1-505 0-027  1:7820 
25 70.934 —0.785 —-0:128 0-368 1136 1633 0:215  1:1560 
26 70580 _0.348 -_0-109 0:517 /0:892 1'505 0-313  0:9226 

27 70-768 -—0:838 —0:203 0285 0:723 1086 0:048 = 1:6980 

28 71176 —0.896 0.453 —0-014 0:393 0.774 -0'229 32140 
70845 _0.785 —0:241 0-334 1136 2117 0-286  0:9954 


SAMPLE 
Items Ability groups Item 
ব্প A Row scale 


29 
30 70934 —0:896 -0-453 0:237 0:860 1-633 0.075  1:6000 
31 TI176 1.224 -0-453 0131 0-723 1-000 =-0:167 2:7860 
32 71672 _0:838 —0.036 0-403 1086 2117 0-177  1:2620 
33 7176 _0.961 -0:385 —-0:041 0:286 0:723 -0'255 3:4120 
34 L041 _0.785 —-0:430 —0:269 0-053 0-426 -0:341 41590 
35 71672 _1:845 —-0:934 -—0:915 —0:460 0:160  -—0'945 16-7100 
36 71176 _o0.961 -—0:241 0:301 o0:570 1-405 -—0'017 1-9720 
A 37 L041 _o0.896 —0:580 —0:301 -0:013 0:802 -0'338 4:1300 
_0:286 0:272 -0:792 11:7500 


38 T1672 =1:224 -—1:260 -—0:580 
1.672 _o0.896 —0:699 -0:673 -0:620 


lump means : 
j ~0:670 _0.286 0-077 0468 ০0:780 1-301 
ity Scale values: 

0.113 0.273 ০0.630 1-549 3117 10:520 


— 0-291 —0:792 11-7500 


the 

for Ee RAN subsamples were treated as one unit. Scale values for 33 items 
artur MEF and for the RAN samples, together with the significance of the 
5 © from unit slope, are presented in Table 3. 

Sam] test the first hypothesis, scale values derived from the CMF and RAN 

j A coefficient of 


es El 
Were Correlated, using the product-moment Statistic. 


dep 


236 J. Anderson et al. 


g as ia r items 
7=0:958 was obtained, evidence for Rasch’s claim that scale values fo 
independent of the sample. AG it slope 
SU io eat the second hypothesis, the items that departed Re Pp <0 
were removed. ‘This was done separately for those items significan obability of 
and those significant at P<0.05. It is to be noted here that the pr' 


TABLE 3. (a) ITEM SCALE VALUES 


; FROM 
AND SIGNIFICANCE OF DEPARTURE 
UNIT SLOPE. 


থ্‌ TO 
ENS FAILING 
(6) ITEM ScALE VALUES RECALCULATED WITH ITEMS F 
Fit THE MoDEeL ReMovED 


CMF es 
EEE 
sae = ===; Scale Scale values 
values Scale values values Seale 
Tiers of all th of all ‘ <0'05 
items <0-01 <0'05 items «001 
5 006 2:32 006 0-05 0-04 425* 0:04 
6 0:22 3.50% 0:23 = 0:20 rg — 0°05 
1 02 Ge om 007 095 007 OY 
8 013 455% 014 = 0:10 3:40* 0:10 — 
10 016 122 047 0414 0:22 BTL 0°25 
1 03 242 032 007 0:36 172 0:35 2 
12 1-11 7-52%* — 111 টল = — 
13 0:63 166 0.67 05s 0-64 2:81* 0:63 — 
14 032 040 0:34 0:28 0:40 3:80* 0:40 0:45 
1s 061 052 064 053 0.64 2.02 0:64 ৯ 
k ‘02 0 - - ES: 
2 0 দত 53 0:44 0:53 4:65 


034 0:28 3 037 9% 
J k 0-37 0:35 - 45 
18 0:72 1-47 0-77 0:63 0 0-51 0:64 Ro 
19 0-69 Oat 64 ১ 0-5 
3:04 0.73 0-76 2.57 0-75 
20 0:63 2-75 0-67 0-56 


0.81 047 0381 nile 
52 098 ot OE 08 ১ I 122 9 
224 1 124 018 0:94 
1 146 148 0-98 k 04 1:33 20 
ঠন্‌ 1 4 135 1 # 1:2 
5A 4-H 172 a0 Im 
25 02 AS TG = 089 327% 087 OT 
26 170 033 098 0381 TO02 gua HO 08 
27 3.21 58 180 149 1-55 2-70 1:53 2:20 
28 1.00 2S 3:40 2-81 346 1:27 3:12 SI 
29° 160 ZF go; 085 7:83%*  — i 
30 2.79 a 258 1:40 183 13-34** — — 
Te SH BH RE- 
y 0.17 [ DT) Y Kk eo 
HO 416 gapes TS 2:99 HE - 455 
EEE OAH: y+ 
36 £13 Ga 0 165 442% 1:63 52 
37 145 oO 1229 363 505 021 498 16 
} 45 10-2 | y 44 
Oe BF BB 8 et 
* P<005 
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being signi ন 
to Gr epee ক the 0-05 level is the more stringent test, as it is the failure 
Was repeated and Tr is being tested. With these items removed, the analysis 
exe recalen late wo new sets of scale values were obtained for each sample. 
the origina] sca scale values are shown in Table 3. The correlation between 
7=0.9999 for all f values (CMF and RAN) and the two new scale values was 
his seem TE NCR 
tot fitting the s to indicate that the recalculation of scale values after those items 
j he Chive have been removed is unnecessary. 
items that fit a ypothesis was examined by correlating the scale values for those 
The difference Laat in both samples. ‘The coefficient obtained was 7 = 0-990. 
correlated ( < etween this coefficient and that obtained when all items were 
Significant, 958) is 0-032, which is in the expected direction although not 
MN 
di Rt bi en hypothesis was examined for only the CMF sample. Six 
Correlated ত Foes were formed and the resultant scale values were 
alysis, Tj those obtained for the ability groupings in the original CMF 
he correlation between the item weights was T= 0996. 


‘The fi $ 
htaineq hypothesis was examined by correlating the six ability scale values 
=0.992. the CMF and RAN samples. The coefficient obtained was 


Vas 3. DIscUussION 

Tlan i 1 i a 
ন) e from ই Rasch’s model appears to be established, in tha 
oS to EE ich the scale values were derived, nor the presence 
Sale values the model, appears to have any real influence on t 


t neither the 
of items that 
he resultant 


On 
stvel at OE arises is that there is some difficulty in evaluating the 
ele 3, Se an item is accepted as fitting the model. As can be seen from 
e ডি items are rejected at the 0:05 level in the CMF sample and 17 in 
mple. Of these, seven are common. At the 0-01 level, three are 
are c om the CMF sample and nine from the RAN sample. Of these, 
ommon. 
ralyse Pre Were 17 items rejected at the 0-05 lev 
Sjecteg » USNg the different ability groupings. 
ahi 0-01 level. 
ig Shes Y, these discrepancies appear disconcerting. The difference in t 
onl Veen all items on the CMF sample analysed by both ability groupings 


থ্] ট ss 
OEE er Ee It appears that the difference is small, but is sufficient to make 
to this ren significant at the 0-05 level and four at the 0-01 level. Part 

0 Coarse. ase can be caused, as Rasch suggests, by the ability groupings being 


Te; 
t SA 


el when the CMF data were 
Of these, seven items were 


ce used by Brooks (1965) is 


lis more appropriate when 


e fit of the model. If the 0-01 level is used, 10 per cent of items fail 
S.P. 


1S 
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N sample. 
fit the model for the CMF sample and 33-3 per cent for et ke" with this 
i -' be noted that the test was not designed to fit Rasch’s mo 
Itis Han the fit could be considered as reasonable. Ge ee RAN 
model does not lend itself to small samples, ক hnidue 
subsamples were considered too small for application of this te 
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AN EM 
EMPIRICAL STUDY OF GENERALIZABILITY COEFFICIENTS 
FOR UNMATCHED DATA 


| By DaAvip M. SHOEMAKER 

J Oklahoma State University 
and H. G. OSBURN 
University of Houston 


bje EC ting within the framework of the theory of generalizability, the main 
{ >Jective of the investigation was that of empirically studying the effect of item 


d 
in a item ‘content stratification on the gamma coefficients using an 
of an wu ed item design. ‘The principal results were as follows: (a) the reliability 
than Re AE test using the unmatched item design Was found to be lower 
(0) stat ity of a comparable test using the matched item design, and 
in the I cation on item difficulty was found to be an extremely important factor 
coefici nmatched item design. These and other results obtained on the gamma 
clents were contrasted with previously reported results for the alpha 


coeffici A ্ি 
Clents involving a matched item design. 


1. INTRODUCTION 


(Crop on bach, Rajaratnam and Gleser have developed in two major artic’e 
Whi ach et al., 1963; Rajaratnam et al., 1965) a theory of generalizability in 


random and stratified random sampling of items is presented as a less 


Ive alternative to the assumption of equivalence of parallel forms in 


ass ট 
maj ‘cal reliability theory. (It should be noted that, while the present study was 


Y Concerned with the sampling of items, the theory of generalizability also 


৭ ি 
Plie + D 
‘relia DN the sampling of observers, 0ccasions, etc.) According to their theory, 


) k eae ’ is synonymous with ‘ generalizability °. An individual's test score 
San regarded as interpretable in its own right; rather, it is regarded as a 
he had rom a universe of test scores that the individual would have received if 
Drigte been tested over all possible random samples of k items from an appro- 
tes Population. Any conclusion an investigator draws concerning a particular 
dete, ie will be generalized to the universe of comparable test scores. To 
los, Lone the dependability of such a generalization, he must determine how 
the j.2 Particular test score agrees with the result to be expected from testing 


the eosividual over all possible samples of k items from the item population— 
ny yon between test score and true score. E 
eri € theory of generalizability two coefficie 
aL le estimates of the correlation between test s 
And, gy Coefficients has two forms—one appropriate 
appropriate for the unstratified item case. 


articles 


strict 


nts, alpha and gamma, are 
core and true score. Each 
for the stratified item Case 
The derivations of these 
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jaratnam et al. (1965). 
coefficients are given by Cronbach et al. Se Lorne BT ae matched, 
The alpha coefficients are the ER ie Re random oS 
ন be 9 2 woos eendia coefficients are appr RF ee ্্ 
EE Jans Case where k test items are randomly selected for 
ee Ep ntly of the items selected for every other person. 1. (1965) have 
ER empiri study of the matched data case, Cronbach et a alpha con- 
shown that, for tests constructed by stratification of Ee ated provides a 
siderably underestimates the true reliability whereas alp কি CEES onstrate 
good estimate of the true reliability of the test. However, fo 


( 
imate of the tru 
by stratification of item difficulty, alpha is almost as good an estimate 

reliability as alpha-stratified. 


large 
HS ces a 

In other words, Steal Tication prose is of item 
discrepancy between alpha and alpha-stratified when the strati dieultys 
content, but a small discrepancy when Stratification is of item 

The importance Of unmatched data e 


Xperiments 
he technique of collectin 


score of an indivi 


d da 
hether matched or unmatche 

are collected, If € Wants relati 

required. If i 


are 

d data 

Ve scores for individuals, then EE propose 

he wishes to estimate the mean for a group of persons ফি referrecr 
items, then unmatched data would be P 


+ The present stugqy deals with the unmatched data case using a des tudying 
similar to that used by Cronbach €t al. (1965). The objective is that of fen as 
the effect of Stratification on the gamma coefficients using unmatched oh 
compared with the Corresponding alpha coefficients using matched items. 


2. THE Gamma AND Gana 


STRATIFIED CoEFFICIENTS of 
Person 7, responding to item i, 
LKpt Over all k items is hi 


m 
he su 
Carns observed score Xp. T 

1S test score Kt. 

defined as the mean of Xj; Over all items i 
Apt over the family of tests. For the unmat, 


lp 
Ore 
Person p has a NE SC ean ধর 
n the universe; RMy is Uf coefficie 
ched data case, the neede 
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f Eenerali iit 2 h 
izabili y is 2 iti Ere eac 
1 Pyryx f for data from all conditions pooled. Where 


Person is 

obser ee) চক - se 

Within-persons fl ঞপস conditions, mean squares for persons and 
abil; a ained by a one-w i rari i 
bility of the totul Co y a one-way analysis of variance, and the generaliz- 


ee 3 =(MSy, — MSwp)[MSy. (1) 

r dichotomous data th i i 

সা সদ“ কিক formula reduces to an unbiased version of the 
ll k 1 kpl-p 
Efi I 2) 

Where k ref. Wl Sy 

. € i 
Variance of ie to the number of items, 1 to the number of people, and st? to the 
ধ the total test scores for persons. 
e 
Oormula for the gamma-stratified coefficient is as follows: 


Kud 


h 
bade SknMS wp) 
}s=1- SE (3) 


This 
হু * 
quation has a special form for dichotomous items 
LE! = a 
> শল i n(kn—ZX.n)—-(n- 1)sn2] 


hls LE — =  — = 
ns i 


he (4) 
ere ¥ 

Rh .h re 
or; fers to the average score over persons for stratum h, kp to the number 


tems i 

In ৰি 

* all ihe oe h, sn? the variance of the stratum h scores, st2 the variance 
Mhiased - and n, the number of people taking the test. This is an 
ariance ’ ratified version of KR 21 using the mean, ZY.n, and the estimated 


c 
€ of subtest scores. 


A 3. METHOD 

r i EEN 5 

test i cedure was developed that simulates the administration of a single 
to a single examinee. The result is a simulation model of great 


h items and individuals. 
buted standardized latent ability 
ng an item correctly is a 


xilbil; 
ity that allows for the sampling of bot 


Gigs Ha” 
Ont. model assumes a normally distri 
e answeri 


inuy 
petmal-ogi. The probability of an examine 
sing Ee function of his ability level. Denoting the probability of person f 
€ ith item by Py, this relation is more explicitly stated as 


(C,-4,)/B 
Pas fT NOV, (5) 


Whe, 

1 = 

lS (Er 

haractert a normally distributed measure of ability, Ai and Bi are values 
ing the item, and N(y) is the standardized normal frequency function. 
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ij for 
termined 
d Bi must be de ine Hl 
n. (5), the values of Ai an expressing 
ৰ ps er a Ls given the equations ;e jee Fe etoreenl (WE 
oy i es of the item difficulty and the biserial a are as follows: 
te Ahe underlying ability continuum (Ri). His resu 
item ani 


(6) 
Ri=(1+ Bi), 


ee oe by 
: i inees 1S given 
‘The item difficulty level, Mi, over the entire population of examin 


ছে (7) 
Mi= || NO) dy, 


distributed variable and Hi= AiR. lue; Bi was Eo 

In the present study, Ri? was set at a predetermined ন randanily selecte 
ntly determined. The item difficulty parameter Hi; wa tly determined. 

quently SEE TE Ad As Was consequently Lawley 

from a normal distribution of Hi; Values, an ‘ Shae 

istri $ Was assigned zero mean and unit ve 

(Mi; Values). 


ifficulties 
কপ ন difficu 
Assures a rectangular distribution of item 
For sin 


gle ability tests, setti 
the inter-item tetrachoric co 
Rif? and Hi, it was Possible t. 
A test, then, 


where Hiis a normally 


ys akes 
ems md 
ng Ri* to a particular value for all Te given 
Trelations a Constant since 7; = RiR;. 


Tf the test 
1 was computed 


Where Fi and Fi; are rand 
tion and Ww, 
inter-item [u) 


m 
ithin-stratu 
Such a way that rw, the within Li 
Predetermine 


se He 
্ ys elation, 
» and 7, the between-strata inter-item corr 


|!) 
d values. on ea¢ 
€ Computer Simulation model assigns a score of unity or ting 4 
test item for every indivi i . ‘This is done by OT ity wi 
Ability of the jth individual Passing the ith item, and selecting if 
Probability P. Probability 1 _ p,,. hetical test, 
ion model described above, hypot contai f 
Were constructed According to the following Specifications: (a) each test an' 
18 items, (b) each test had three Strat 
(d) the Probability of p 


a onS) 

E ix item? ge 
চ 4, (C) each stratum contained si wi 
of i Assing each tes 
range of ite. 


.99—a 
t item ranged from 0-01 to 0-99 
es. 


e8 
valu 
i iately setting the V™ nd 
k Accomplished by appropriate y stratum, * 30, 
Choric correla: between items within a 


Tw = 5 
Across strata. Values of rw (rw 5, 
» 0-30 and 


nd 0.50) and sample size N= 


foes 


VA 
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100, 1 ৰ 

ন চত কা লৰ manipulated for tests stratified on content and on content 
to difficulty lo ) en tests were constructed with items stratified only according 
Because of he , 7b =7w; and only values of rw and N were systematically varied. 
data, there ER amount of computer time involved in generating unmatched 
tit end Nr Cr five replications of each combination of stratification pattern, 
inlegration bos he necessary values of pirx® were computed by numerical 

ased on equations given by Lawley (1943). 


চ 4. RESULTS AND DISCUSSION 

e ্ি 

E(p,,2) he sees study focused on three general questions: (1) How does 

i the ্ু I iability of tests employing the matched item design, compare with 

Stratification iability of tests using unmatched items? (2) What is the effect of 
nin unmatched item studies? (3) What effect does stratification have 


On the sam lin সণ 
pling variability of gamma over samples of examinees? 


EE 
ৰ 1. CoMPARISON OF MATCHED VERSUS UNMATCHED TEST 
XPERIMENTS: RANDOMLY SAMPLED EIGHTEEN-ITEM ‘TESTS 


tw put Pur 
(From Cronbach et al., 1965) 

oul 0-961 0:898 

on 0-913 0-840 

020 0:860 0-775 

30 0-757 0-656 


lege 1 Presents a comparison of average pan? With purx fOr unstratified 
Magnitude cr at various values of inter-item correlations. This table shows the 
e magnitud the difference. In general, as the item intercorrelation decreases, 

ade for t e of the difference increases. The same conclusion can also be 
the qata ests constructed by stratification of item content, as can be seen from 
only op EEE EDO in Table2. A comparison of pun> and Pug for tests stratified 
OWer th ontent shows that test reliability for the unmatched item design 1s 

an the reliability of a comparable matched item design. 


SUS UNMATCHED TEST EXPERIMENTS: 


Tap 
LE 

2. CoMPARISON OF MATCHED VER 
ITEM CONTENT 


TESTS STRATIFIED FOR 


(Da 
ta 
On pyr, ws and «x from Cronbach et al. (1965). Ys: and y are averages based upon 


five samples of 200 cases each.) 


ry 
100 i put as a pax Ys Lt 
0S 0 0:866 0-860 0-758 0-747 0-736 0-641 
050 0-765 0-757 0-668 0-639 0-654 0-582 
30 0 0-664 0-656 0-579 0-535 0-533 0-464 
0 0-507 0-501 0-442 0-389 0-412 0-362 
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হু ients for tests 
iti omparison of alpha and alpha-stratified se getaot fied 
In addition, c HE with the corresponding gamma an. nd EO 
ECL EC oR ioe shows that the discrepancy between gam nha and alphas 
soeficients eS €e magnitude as the discrepancy DERTEEn Bt dle ain 
SE UM a stihcal oy for item content seems to have 
EE ক matched and unmatched item designs. 
effe 


TABLE 3. COMPARISON OF Mar 


NTS: 
Mae {PERIME: 
CHED VERSUS UNMATCHED TEST Ex 
SINGLE-FAcTOR Ts 


J TY 
S STRATIFIED FOR ITEM DIFFICUL 


ls] 
based upo 
5 d y are averages 

2 d «x from Cronbach et al. (1965). Ys an 

Haid i inde oid five samples of 200 Cases each.) 


7 
2 Ys 879 
2 Xs u PMX 0 
Prt .962 ue 
0.978 0-970 0.942 0-977 I 0816 
he | t 0-918 0 0-7 
06 0919 0.915 0:902 
0:50 0:863 0-859 0-851 0:862 ot 0 
0:30 0.760 0.754 0.752 0-760 


jy ifficulty 
rE hig item diffi 

The results are quite different for tests Stratified in terms খল for tests 
as shown in Table 3 Comparison of Pan With pry A item design 
stratified for item difficulty, the test reliability using an unmatc 


Also, 

. 1m. . 

bility of a comparable matched Ie, i> coeffici" 
alpha-stratified with the corresponding ga 


2 while 
. i S Puy 
Shows that gamma Considerably underestimates pi for 
timate of Pu TH 


ক ification 
iain to be made is that stratification yy 
ems to make little difference in the matched item 

1 an unmatched item experiment. 


ents in Table 3 
alpha is a Eood es 
item difficulty se 
is all-important i 


TABLE 4, CoMpPaRIson OF MarcHEp VER: 
'TEsTs STRATIFIED FOR B 


NTS: 
রি RIME: 

SUS UNMATCHED TEST EXPE 

(Data on Put and A 


ENT 
OTH Ire DispicuLry AND CoNT 
from Cronbac 


[1 
ed up 
h et al. (1965). Ys and y are averages bas 
Ve samples of 200 cases each.) y 
rw 7 pu as Puzx*® Ys 0 
1:00 0 0.926 0.912 0.935 0:924 059 
0-70 0 0.787 0-780 0.789 0-794 04 
0:50 0 0.675 0-669 0.676 0:695 0” 
0:30 0 0.512 0.508 0.513 52 
Table 4 


Jt)" 

ifficu 

T both content লি ক is be 

At stratification for item diffic condi of 

: iment, and that under these liability 
matched item tests is quite comparable to the re 

tests using the Conventional matched it 


em technique. 
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‘Tas 
LE 5, 
Mobs os a ED STANDARD ERROR OF Y AND ys AS A FUNCTION OF THE 
ATIFICATION, THE WITHIN-STRATUM INTERCORRELATION AND 
SAMPLE SIZE 


rw = 0:30 rw = 0°50 rie 070 tw = 0°99 


Dificulty Ss Y Ys YF Ys F Ys ¥ Ys 
5 0-139 0:079 0-069 0-039 ০0:032 0-011 0-027 0:007 
is 0-101 ০0-050 ০0:027 ০0:014 0032 0-016 0:028 0:006 
> 0:084 0-041 0:033 ০0-021 0-030 0-017 0-020 0:004 


Stratification N 
150 0:044 0:022 0:026 ০0-015 ০0-013 0:009 0-019 0-004 
0-039 0:023 0-029 0-018 ০0017 0.011 0:015 0-003 
0:009 0-002 


200 0:030 0:020 ০0-025 0-015 ০0-011 0-008 


0-179  0:186 0:049 0-046 0:049 0-039 
75 0-116 0-070 ০0051 0:063 0-045 
0-119 0-106 ০0:079 ০0:078 0-063 0:042 0-054 0-036 


100 0.127 0.115 0.066 0:060 ০0'050 0'037 0:080 0-048 
0.031 0-049 ০0:041 0:070 0-044 


0.032 0-046 0039 0:048 0-031 


Cont, 
0b 25  0:153 0138 


7 = 0.0) 
50 0.129 0118 0113 


150  0:102 0.094 0-035 
200 ০0:081 0.075 ০0038 


0-031 0:055 0:024 
0.035 0010 
Tp 0.098 0-052 0-100 0:049 ০0066 0.031 0-046 ০0015 
5 0.059 0-036 ০0093 0:049 0:064 0:032 0-044 0-014 
on 0.039 0-024 ০0-098 0-050 0048 0023 0-032 0-009 

0-007 ০0-009 ০0-071 ০0:033 0:041 0-019 0-018 0-007 


Conte 
a nt~— 
piculty 25 0:260 0-125 0296 0-140 0-070 
7= 0.0) 50 0.152 0-069 0-150 ০0:070 0:094 0037 


Fi 3 
an nally, turning to the data in Table 5 on the sampling variability of gamma 


al i 
tends ye pI, it is seen that the sampling variability of gamma-stratified 
con dition e smaller than the sampling variability of gamma for all stratification 
nl SE os discrepancy being most marked for small N and low item 
ations. However, it is clear that stratification by item difficulty 
riability of gamma-stratified. 
i 


duc. 
€S the greatest effect on the sampling va 
fication by item difficulty is 


CSe re ৰি 
sults reinforce the conclusion that strati 


ext 
emely i 
ly important when collecting unmatched data. 
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ASCO ন 
RING SYSTEM FOR MULTIPLE CHOICE TEST ITEMS 


By J. O. RAMSAY 
McGill University 


Wei 
Which nn Scores for the alternatives in multiple-choice items are proposed 
hese are th ন the separation in mean score between two criterion groups. 
for the two e differences between the probabilities of choosing the alternatives 
ventional DES oy Several advantages of this scoring system over the con- 
two Criterion are discussed and the method extended to the case of more than 
Eroups. 


1. INTRODUCTION 


The ; 
term ‘ multiple choice ’ usually refers to test items in which there are 


s of this paper, the term 


le responses, including 
es are treated 


More 

“Te tha ্‌ 

will ese possible responses. For the purpose 

items in i broadly to any item with two or more possib 

Collect; ich one response is correct and all other respons 
Vely as the wrong one. 


he 
Unity to ea universal method of scoring these item: 
i € Correct response and zero to all of the others. There are two much- 


| disc 
| attain টি Problems with this scheme. First, the mean score that can be 
knoyn. ese is large, and the laws which guessing obeys are not generally 
0 provisio nT closely related disadvantage is that this scheme makes 
tong or ee partial knowledge, for the fact that some alternatives are more 
ese di GR ight than others, or even that more than one answer may be correct. 
de Binet ee have been discussed at some length by Chernoff (1962) and 
Sc Although it is of obvious importance t 


eme 
Tha i much test development does or ca 
ls, a pool of items may have been given to some body of examinees and it 


Le to devise an efficient test on the basis of their responses. There is 

0 grou € criterion available which can be used. to divide the examinees into 

So ft at টা Bi ‘The test should then reproduce this division as well as possible 
€ criterion need not be used on future groups. 


s is the assigning of 


0 arrive at a suitable a priori scoring 
A n take place on an a posteriori basis. 


oft 


S 2. PROPOSED SCORING METHOD 
(grou Uppose that the population of examinees can be separated into two groups 
dicated by T, and that thisis a 


Si ne 1 and group 2), that a total test score is in 

the li of the scores, Si, obtained on 1 items. Suppose also that each of 

tem 7 ternatives in item i has attached to it a weight, wij, and that the score on 
1s simply the weight attached to the alternative which has been chosen. 
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ns Ga ; is Pijlr), then 
If, in group 7, the Probability. of choosing alternative jin item iis Pi(j|r) 
the mean item score is given by 


J (1) 
E[Sil] =X owiPi(j hr). 
$ 
The fact that it is acce 


d পদে WEE tween 
Ppted practice in statistics to discriminate be 
two groups on the b 


gests that the weights, wij, sigh 
means of the two groups by as much as pos 

ze the following simple criterion: 0) 
©=(E[T|]- ELT]: 


asis of their means su 
chosen so as to separate the 


That is, they should maximi 


subject to the restriction: 


(3) 
Xu =C. 

$a 
Tf eqn. (2) is rewritten: 


2 


9-[ যয i) -2 3 ey) 


[yet ng], 
eo *. 3 
and it is noted that thi i 


নল্স [P(Gh)- Py; 2)]2 
“IP + Spy 
3 3 


_22P(G)P 2) 
<¥Pl): + pg 

bl [32 DE [32 52]. 5) 
<2 ” Mt 


tem! 
alys 


This latter result is of importanc g 
Separates the means of thet € Since 
purposes. 

Argument (5) 
by an item occurs 0 
and all the membe 
the best multiple c 


ৰ Ii 
Xjwy isa measure of how we a 
0 Sroups and hence valuable for item 


also implies that the 
nly when al 
TS of the ot 
hoice item 


0 groub, 
Mum separation of the tw tiv 


al 
tern”, 
of one group choose one al at 38! 


Ose one other alternative. 
Se is 


maxi 
1 the members 
her group cho 
in thi A 
nD this sen really a two-choice item. 4 


———_—_ 
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Criteri ; 

g eg @ is a special case of a more general criterion applicable to 

Sel the riterion scores Xr (r=, +48). If the X's are scaled so that 

the Oa en maximizing the following criterion is equivalent to maximizing 
nce between the criterion scores and the expected test scores: 


) 2 
0={ExBtT} . be 
f this Criterion is rewritten: | 


[] nk 2 
= ৯ SY wyP( iv) 


$ J 
nt [) 2 
{55 [ 3x20} : (0) 
en it i ll [SE r 
It is again apparent that the optimal weights are given by: 
[A 
Wg = aSAXrP(s |). (8) 
r 


1 
Theses 
foy Se Weights are identical to those given by eqn. () when g= 2, except possibly 
8 scale factor. 


3. DIscuUssION 


ত) UE several advantages to these scoring schemes aside from those 

| COns, (4)০ E A The substitution of sample estimates of the probabilities in 
1 a teh ) provides minimum variance unbiased estimators for the weights. 
iagnostic ts are especially useful for items of personality inventories used for 
For The. Purposes in which. the alternatives have no clear a priort weightings. 
na Mans common tests in which one answer is Correct, these weights give 
Ose ho in mean score to those who have partial knowledge and give to 
aking on Suess randomly amongst all the alternatives a mean score of zero. 
Fina, b a test of this kind could give a significantly large negative total score. 
est টি খান partial knowledge is rewarded, the reliability and validity of a 
1 Scheme in this manner should be greater than for a test scored by the 0-1 
0) una weights do have two minor problems. ‘The first concerns criteria 
if nd (6). Neither of these implies the minimization of misclassifications ; and 


A Sityati 

u ate : ; 
ups lon arises in which the increase in V. 
May be as an effect which exceeds the effect of separa 
is > Bained by this criterion. However, it seems u 


Aris 
In practice. 
titer fF Second problem is the danger of perpetuating bad or out-of-date 
to en dividing the population. ‘To some extent this difficulty is common 
his Situat, analysis procedures, and whether or not it would be more acute in 
bk Gens is not clear. ‘This underlines the necessity of continually updating 
ia. 


ariance of the scores in the two 
ting the means, then little 
nlikely that this will often 
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the 
In summary, if the weighting of an alternative is ee লোন ট 
difference between the probabilities of choosing this alternative ক * nt 
groups, then this is a scoring system which not only involves ঢ) ol generally 
of computation and lends itself easily to item selection, but it wi Ri of the 
distinguish between the two groups efficiently and improve the reli 
test score. 
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ATTRIBUTE AND RELIABILITY 


By JoHN Ross and JAMES LUMSDEN 
‘The University of Western Australia 


that ক ls of classical test theory and of more recent developments shows 
I Whese eles of true score, parallel test and reliability are poorly defined. 
detached fro oY is defined as generalizability it is not so much poorly defined as 
as such be Es the essential interior properties of the test. Itis proposed that tests 
dimension NE defined as devices which map an attribute in a single 

l logistic model which fits this conception is examined andit is shown 


that true 
| rue score, parallel test and reliability can be defined without ambiguity, 


inconsis R - E Hie”! 
SORE To re or circularity. Deeper investigation reveals that the true score 
| is not only unnecessary as a basis of reliability but also a distraction. 


Reliability i. 

that le 1s formulated as a characterization of the test's mapping. It is shown 

there € mapping can be characterized for the model and it is also shown that 
Ss a distinction between information loss due to regression and error which 


IT é নী 
Probably lies behind the poorly conceived notion of true score. 


1. CONTEMPORARY VIEWS OF RELIABILITY 
hich the estimates 


Es Ee reliability problem is to determine the extent to W 
Solution চি est might be in error. There have been numerous attempts at 
that tt A back to the work of Spearman (1910). It is shown in this paper 
lutions tp far have been unsuccessful, although they have produced 
y resteoth other related problems. A solution to the problem is then presented 
ensure Ing the problem to tests which can be considered to be devices that 
teliabilj an attribute. “The solution is illustrated by a method of describing 
[S ity fully for itemized tests which meet the requirements of a logistic item 


AT; A 
Etistic curve model. 
ir: é H 5 3 aun 5 
St, an examination is made of three relatively recent positions which, 


ety: 
cen them, span the range of modern thinking on the subject. 


Class 

Usical test theory 

yee test theory, by which we 
te, SEM (1950), bases its approach 


roughly mean test theory as codified by 


to reliability on a theoretical analysis of 
ts of true score 


St: 
(0): It constructs a simple model using the theoretical concep 
va Parallel forms (PF) and error (E), in terms of which observed scores and the 
score variance is said to be the 


ariat; a 
চট ls oe in them can be explained. Observed 
rue ore = score variance and error variance. I 
ctween Ea and error Variance, and is estimate 
ity ET el forms. In terms of the model it can be s 
ient 722 is given by 
T2z2= ot (ot + 0¢°), 


Wh, 
ere ci? ij a 3 
t2 is true-score variance and oe is error variance. 


Reliability is defined in terms of 
d from the correlation 
hown that the relia- 
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টট kl ved 

‘Two of the three theoretical terms, T and E, are tied আন ক ও পা 4 

score X, by the formula X= T+ E. The third theoretical term, PFE, 

in terms of T and E by the following conditions: t equal the 
(i) Tig=Tin [fy The true score of any person on one test "9 11). 

true score of that person on the other Parallel test —Gulliksen, 1950, Ten oie 
(ii) Seg= Sen (‘ For two Parallel tests, the errors of measuren 

equal "—Gulliksen, 1950, Dp. 12) | el foun 
It is impossible to apply the classical model as it stands, since para 

are defined i 


| রি " vith the 
In terms of T and £, to which there is no direct access, W 
result that observed Varianc 


ical 
§ eoretic 
¢ cannot be decomposed into the required th 
Variances of of and oe2. “ 


3 t of 
An application of Sorts is achieved by buttressing the model with a se 
criteria for parallel forms 


A set of tests is said to be parallel (Gulliksen, 
ch. 3) if 


rmore, 
2% OF oto? + ce°). Furthe 
be expressed as a limit 


i XK 
T= lim > (XK. 
9=1 


0 
two 
Rk We Can alw small subset, say tw 
three Nn number, with the same nd 


he 
whet 
€ tests in the Subset were really parallel or w, Be 
1 
€ had so Testricted the Subset that Ton-parallel tests appeared PEE : ' 
needs to be Some rule of test th 


selection, i 


[+] 
sy 
[= 
Es 
[5] 
[=] 
[4] 
[=] 
nn 
ot 
[5 
0 
[a 
“nn 
2 
fe 
ঢ 
2. 
[] 
nn 
] 
= 
ত 
>» 
in 
চে 


ন may 
:“t more than one equally acceptable basis lel 
Set, which ‘Would mean that a test would be Ps ha 
ও different reliabilities, If so, it would ing up” 
shifts in expert opinion “tution, in terms of T and E, dependi 
The fundamental deficiency in 


্ o 2 a ly the 

of any basis within the definiti nalysis is not simp f test 
i On of le 0 0 

Which the Statistical criteria are applied Ls Re EE he theo, 

0% the formation of the set, T is necessar i han 3 oh basis, and to ys 

it in the theory if the theory is to Provide « Y to have suc a 5’. the and 

of reliability. Benuine foundation 


Jack 


the Classical a s to 
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Some theorists have been drawn to the conclusion that the reliability 
te must be ‘re-phrased as a question about generalizability. We will 
Hb Sider this view in the next section, and show that it amounts to abandoning 

€ problem of reliability in its original form. 


Reliability as generalizability 
Cronbach et al. (1963, p. 144) have redefined the problem of reliability: 
‘0 investigator asks about the precision or reliability of a measure because he 
see to generalize from the observation in hand to some class of observations 
06 Which it belongs.’ They pin reliability not to a test, but to the universe of 
nditions over which an investigator wishes to generalize. There are thus as 
Many reliabilities for a test as universes that investigators are able to specify. 
ain The position reached by Cronbach et al. is an important generalization and 
ratio el, tension from previous theoretical studies. ‘The authors say their 
allo nale 1s one which combines numerous lines of thought about reliability and 
‘WS reinterpretation of them. | 
val ‘The ‘liberalization’ of reliability theory that Cronbach et al. offer is 
“uable for a variety of practical purposes. Ina given administrative context, 
ere might be a well-defined universe of conditions (such as the raters available, 
5 alternative test forms available, or the testing locations available) and the 
i SStigators may well wish to know about variance in estimates with variation 
an: Any or all of such ‘facets’. But the investigator who requires a theoretical 
sis of his test, one such as classical theory attempted to provide, will not 
it, 
| theo, Cronbach et al. present a model which might be taken to be the basis for | 
| i retical analysis. The model accounts for scores obtained by an individua 
Sidi, Condition of the universe in terms of a universe score and several 

* fy Ary scores. The universe score is defined as the individual s score on the 

pst centroid factor of the covariances between the conditions in the universe. 
t be Subsidiary scores are scores on other centroid factors required to account 

the residual covariances, and an error score. 

{| ee is shown that, from the model, it is possible to estimate th 
«Ween universe score and obtained score provided three conditions are met: 
fi ( ‘The universe is described unambiguously, so that it is clear what conditions 

Within the universe. (2) Conditions are experimentally independent... 


3 

3 Scores Xt are numbers on an interval scale.’ } 
he decomposition of obtained score variance into universe score variance 
} ag. Other variance might be taken as a sufficient theoretical analysis, substituting 
Tl ately for the classical target of a true variance, error Variance decomposition. 
© difficulties with such an interpretation are as follows. Defining the 
alee Score as a factor score on the centroid factor from a covariance matrix 
lS Universe score change with changes in the scaling unit of any test the 
the ‘rse (see Ross, 1963). Secondly, if the centroid is to be interpretable from 


R efinition of the universe, the universe must be defined in theoretical terms, 
S.P. 


e correlation 


id 
i first centro 
d the structure of the test population must be such that pe An 
facial corresponds to the factor common to all the tests, and isn 

at 


et 
iti could be ml 
by secondary factors. It is difficult to see that such conditions 


চ test 
মম i iti without a 
ithout a very precise and complete universe definition, and Vv 
টি 
construction theory sufficient to e 


j e 
দি with th 
nsure that the centroid পাল ধা” ETE 
; 2 ni J 
definition. Universe score variance will not, except in most cle nest 
1 score, or at least to one whic Alyse 
ts the sample of tests for factor ana 


ich 
. ate o whic 
em, it was to determine the extent t 
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» and, as we have seen hat the ori 
the concept of universe Score. An attempt will be made to show t 
question is answerable by rest 


{ nsidered to 5° 
ricting it to tests which can be co 
Measures of an attribute. 


রথ it WOI* 
tions and applications. i, 
Universe of content. Secondly, it deals usually ad 
Or on i ntent. ‘Thirdly, it need a in the 
it does not speak of true oo 
it deals explicitly with Hoe veut 
’ ixthly, in virtue of this, it does no 3 
ily the fundamental datum of a test. Tul Score 
Onograph is Lord’s 4 Theory of on: on Ul 
€ modern test theory positio the 


and tha aif 
i Y by error of measurement 
S Zero. I he fii 


b ) 

Ain St is to assume that it is arti fe 
ৰ Tes di er 0 লা 

SEER 1S to define true Score as Nypod ie Sf erm le Fe up of est 

pool of items which have Properties the Sahe as those which snake UP Ie 

at hand. A variant of this Approach is to assume that the test is a ran or be 

of items from such a pool. € third approach makes assumptions 2°, oud 

distribution of the error of Measurement COfnpoNEAtS Of Observed scores 

their expected value Ponents 0 


ENA lac 
i Et ord goes on to make distinctions ( 
classical theory) between the true s 


Jyin 
Core of a test and the ‘ ability underly 


test ’ or lat. F -? 
liters NE Fe ১ Et has a limited range, whereas the 
Psychological diner sion» 2 portantly, non-parallel tests of the same 
ut the same latent in necessarily have a different true score metric 

Ain yeteet ch 5 continuum (or underlying ability metric). 
etic :and the as three metrics ’, the observed score metric, the true score 
Vtnotand atent continuum metric. Observed score is related to true score 
latent 0 true score is a monotonic, but non-linear transformation of the 

TT inuum. 

tions, A made about the true score models is that they make weak assump- 
Mbout es nt i latent continuum models which make strong assumptions 
OF a, Weak 0 es (cf. Lord, 1952). Some information is lost by settling 
curvilinear corr TR since one Works from true score which has a perfect but 
e that the or e with the latent continuum. Lord’s argument appears ‘to 
of the true ! ce of information loss is worth paying for the gain in applicability 
core models. 
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approach are not removed in 
The matched-forms model encounters Guttman’s 
of parallel test sets. The 
m to the item level, and 
liability to any particular 
d set of tests of which the particular 


f Tn, the same objections. 
em in a 2 s (1952) monograph, true score 1 0s 
tue score est was tied to a common attribute by an item characteristic Curve. A 
eG could be and was defined in terms of the model, not as a primitive term, 
stable in similarly could be defined in terms of the model. ‘The model was 
tained terms of predictions it made from item parameters about the shape of 
ton the Score distributions, and it provided, in principle, a basis for inference 
i Ge of item responses to location of an individual on the attribute. 
ecause Cy ty with Lord s model was the intractability of its mathematics, 
eristic ke the assumption of the normal ogive as the form of the item charac- 
CORerene The model itself, however, is theoretically sound, and allows 
reatne nt treatment of the reliability question in its original form. This 

Tt ER not involve a consideration of true score. 
irnba be shown that a slight modification of Lord’s model, made by 
Of test ন (1957) which removes the mathematical difficulties, provides one class 
or which itis possible to give a coherent treatment of reliability. 


held a less central position. Each 


{ T 2. AN ATTRIBUTE-BASED ITEM MoDEL 

sto 0 accept an attribute model like Lord’s, or the Birnbaum revision which 

that ন Presented, is to admit that there is a class of tests and a class of non-tests; 
ere may be procedures which look like tests, but which are not. An 
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is an 
i WS. There 

tt d test can be depicted theoretically as - "shure. The 
3 s boietosien 2 metric space which supports variation in 

attribute space, I 


1 
i the rea 
in! hich is usually 

itself maps the attribute Space into a test space, whic 

t tS. fi 

ALE or the integers, but which m 


or a multidimensional metric ড্ঠ 
that there is a well-defined att. 


ত s at once. 
duce is governed by several attributes a if 

ininum 
t is not a test of any attribute, BIC | ways 
Which can be checked in le since the 
del provides a second Toe a predlee 
S the response to each item a টা tribution 1s 
test scores. If the obtained dis 


t 
ন sthat 0 
an item if his location is abovértf 
mM, and truly fails if : 


me 
i . is assu 
Since continuity is as ored: 


ot 8 


f items (7 he 
i fl 
a giver fample 0 

Universe of items), 


asic model is re 


Presented in Fig 
location of a set 0. 


ef 
্ the upP (4) 
Scope PATS from linearity by চপ i as 
Upon true Score, in vireo of the discontinuities in a5 আত 
he fact that true score depends upon item Iooatiort 
On the fundamenta] conti 
Cation of items and perso 
distribution of locations. B items and cribe feds 
with which they appear in locations along the Continuum ca ele i 
Probability density function this sta nctioy only 
although the normal density function and the logistic density Tacation nt 
commonly postulated. It is further Assumed that People shift sje A 
slowly over time, but that items Shift 


person locations 
The point lo 


EEE none aa cn a SENT 
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i a b € d 
em distribs ] 
and medians ) Fi | | True Obtained 
! | |! EE সর Individual score score 
Persons &j A&A AIVAY vi continuum T 0 0 
II 1 ! 
II Zz 1 
IV 2 4 
Vv 3 3 
4 Has Obtained re VI 4 4 
; 0 | 2 3 4 score score OT 234 


FicuRE 1. The relation between the attribute continuum, true score and obtained score. 
Persons are shown as having point locations (triangles below the top line) and items 
as probabilistic locations (probability density distributions above the top line) along 
a directed attribute continuum. True score and obtained score are shown as 
independently derived from the attribute locations of persons and items. True scores 
are determined from medians of item locations, a person passing an item if his loca- 
tion is to the right of its median location. Obtained scores are determined from the 
probability density distribution. ‘The probability of a person’s passing an item is 
the probability under the density distribution for the item to the left of the location 
of the person. Some possible discrepancies between true score and error score 


are shown in the table inset. 


Ount to assuming that there is an indeterminacy in the location of items which 
not the case for people. Since itis assumed that a person passes an item at any 
sting if his location is above the location drawn from the item distribution, 
tained score is thus undefined except in probabilistic terms. ‘True score is 
defined at any given instant, but may vary over time. The question of attribute 
Stability is thus made quite distinct from that of attribute measurement. It 
Might be noted that it niakes no essential difference to locate the momentary 
‘Mdeterminacy in the pérsons, or to spread it over both persons and items. hl It 
as been chosen to locate the indeterminacy in the item in order to show that it is 
Possible to construct a coherent theory on the assumption that persons have a 
Xed location, which is equivalent to assuming that they have a definite amount 
of the attribute. | 
Where true score is defined in terms of the momentarily fixed person location 
And the median of the momentary distribution of item locations, obtained item 
Score is defined in terms of the fixed person location and a random location from 
€ item distribution. True score and obtained item Score are defined inde- 
Pendently in terms of the attribute and the characteristics of items and persons. 
tained score is not defined directly as true score plus error. This is a fact 
ich follows frongthe definition of true score and obtained score. There are 
in Sequences of hnsiderable importance when we come to consider variation 
ftror over the true score range. , f 
tue score is defined in terms of a fixed set of items, but a person's location 
© attribute is independent of any set of items which might be ev ied 
Us as item sets vary, true scores will vary, and one can ask about any set 0 


ks 


On ft 


John Ross and James Lumsden 
258 


irs of 
ডী Pairs 0 
i ntinuum. is 
্ sents the attribute co 3 foogtiol! 
a well their true score repre in median: 
ৰ ag Consist of items which are matched oe A REE than 
i i hi reater dispe ie 
Se f the items may show g Lktheobtal 
but the distributions o jel CEs re en Oa ঢ 
8 irs of sets would yield the sam le dere 
inthe other. Such pairs he OB i. 
Ty the true score more in the na thesia 
score would vary about by defining ne 
i test we may do so by Eo de 
h to define parallel forms of a e may ld als 
which yield the same true score for all individuals. EE | rE atched for 
matched forms of a test, which are both parallel, and : mn : a ade, it i5 
dispersion as well as median location. Once these distincti E 


WwW 
iability: (1) Ho 
clear that there are three questions in the area usually called reliability 

well does the true score represe 


gine 
e obtai 
nt the attribute? (2) How well does th 

Score represent the true score? 


t 
Rt represen 
(3) How well does the obtained score rep 

the attribute? The last is the central question. 


Other types of test 


numbers, 
ference that Properties of Obje nD adequately be represented by th 
and that others do not. That the inference can be drawn can be ঠ a function 
Proof of a representation mM, which establishes that there is procedu 
mapping from the empirical Structure to the real numbers. Ed 
for which the existence of a mapping function could be establishe 

test in our Sense. 

Most case 


ison 
ris’ 
a . . i comp? 
s studied by Axlomatic means have employed the direct 
of Objects one with another, as 
for preference 


[2-5 


ndirect comparison. 
Ment of the model 
Proper test has b 


een undertaken, it s 
could be develo ed i 
any given attribute. 
the requirement that ii 
sample of the items, then 4 


on 07 
han B and B higher than " 0 
sufficient size. Diffe 


]e5 d 
dice 
reju 5 
ট d provide indirect but be 0 ne 
ro. Simi transitivity of scaled score in differe 
m the empirical differences in scaled scores 


3. THs Locistic Vv 


EL ip { 
0 RIBUTE- BASED ITEM pe 
In Bimmbaum’s (957) i € item Characteristic curves ope b 
form. The logistic function, like the normal, is completely spect 


Attribute and Reliability 259 


Hones one specifying location, the other dispersion. Birnbaum shows that 
যয পর শপ যশ লাপত [ 31-7" তল 
Assuming one knows the distribution of tdatols en ihe etebute, 4 £ 
L li est 
Which meets the Birnbaum model formally satisfies the requirements of a 
testing device as we have developed it, and it is possible to estimate the para- 
Meters needed to apply the model. A strong test of the model is possible for any 
Biven test. It is possible to derive a predicted distribution of a weighted test 
Score from the item parameters alone. Ross (1966) has recently shown that 
the model fits some six tests varying in content, item format, item difficulty and 
Average inter-item correlation. 
From the same item parameters which yield the predicted score distribution 
One can also compute a regression of test score on the attribute and the variance 


Of test score about the regression as a function of the attribute. 


An example of the application 
Birnbaum’s model assumes that there is a single ability involved, Y, and that 
the Probability, py, of passing item g is given by the logistic function. The basic 
equation in the model is thus 
b= 11 +exp(by — agy)), 

Where ap is a dispersion parameter depending upon the correlation between 
tem g and the attribute, and by is a location parameter. The definitions of ag 
And by are 
ag=CroV(l-rgf),  bi=N-Apa)V(C + as), 
Where C is a constant (approx. 1:7), 7 is the item-attribute correlation, and 

“1(p,) is the normal deviate corresponding to the proportion of the population 


Passing the item. 

1 With these definitions, 
957; Ross, 1966), Birnbaum develops a weighted te 

AS the sum of the items passed, each weighted by ag, 
Above. Birnbaum then develops three propositions which are central to our 


development: (1) that the statistic tis a sufficient statistic, i.e. it preserves all the 
Ormation in the test protocol relevant to the estimation of the value of Y; 
2) that there is a simple equation, using ag and by, which gives the expected 
ue of t for any value y of Y; (3) that there is another simple equation, again 
Using ag and by, which gives the asymptotic variance of t for any value y of Y. 
i ‘The direct implication is that if the model fits, a maximally efficient yi 
Hr dure can be set up for estimating the attribute, not from observed score but 
io the weighted score t, which is the sufficient statistic. Furthermore, the 
্ del is self-protective in that if a distribution of ability is assumed, and if ag 
of + by are estimated for each item, it can then be estimated what bee EE 
th Scores should be. If they are as predicted, there is solid supporting eviden ন 
At the model fits and that the test is a test in the general sense of a device whic 
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Once obvious is that both the regression function and the variance function are 
Non-linear, which makes it impossible to reduce the relation between the statistic 
and the attribute to any single index. The average variance could be computed 
Or an average weighted variance according to the attribute distribution but 
this would not be helpful. The fit between the expected and obtained distribiis 
tions encourages the belief that the model is close to being realized in the two 
tests studied. Even better fits were obtained with four other tests which gave 


extreme U-shaped distributions. 


Implications for reliability 

The reliability problem, as stated, is to determine the extent to which 
estimates given by a test might be in error. For attribute-based itemized tests, 
Which map from attribute to test space, the problem is to describe precisely the 
error or variability in the mapping. Birnbaum’s model first describes the 
mapping by way of a regression of test score on the attribute (see Fig. 3), and 
secondly describes the error in the mapping by giving the variance of test score 
about the regression as a function of the attribute. This is a complete answer 
to the reliability problem as it was posed. ‘The answer is not in the form of an 
index, but of two functions which together provide the required description of 
the test mapping. Indices comparable to reliability coefficients might be 
derived, but they would be incomplete descriptions in all but very special cases. 
An answer in similar form is possible in principle for tests of all types, not only 


for itemized tests. 


4. THE FALSE TRAIL OF TRUE SCORE 


In his 1952 monograph Lord scarcely mentioned true score, nor needed to 
for his development of test theory. In 1959, however, he called true score the 
fundamental concept, and treated it as a genotype or ideal, an obtained score 
Stripped of error. The shift in Lord’s position has been accompanied by a 
Concern to find models (like the item-sampling models) which will fit testing 


Situations which do not satisfy the strict demands of the 1952 model. In cases 
peak of an attribute, it is tempting to set up 


But, as argued above, true score needs to be 


efined from an attribute and will not stand, by itself, as an interior property of 
the test, The outcome of the attempt to set up true score as an ideal is, in point 
of fact, the position reached by Cronbach et al., which attributes to the test as 
any true scores as universes which may be found for it. In brief then, while 
tue score may suffice as a conceptual tool with which to examine the exterior 
toperties of a test (its relations to other tests and criteria) it will not suffice for 
Xamining the intrinsic and singular interior properties of “the test. The hard 
th, as insisted earlier in this paper, is that there do exist tests in the usual 
nse which cannot be handled as tests in the psychometric sense. Some tests 
Ay be separable into their unitary components, as a mixture is separable into 


sete it makes no good sense to § 
tue score as an alternative ideal. 
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of ron considered as noise, constitutes a further source of information loss, but 
১ ifferent kind. One may speak of two tests with a similar regression of 
ribute on statistic as informationally similar, in the sense that both are bound 
by the same informational limit, and regard the test with the greater conditional 
Variance about the regression as differing in the second source of information 
loss, error. It is probably this distinction which lies at the heart of the true score 
concept which leads to the so-called attenuation paradox. The distinction 
seems to serve no practical purpose in the case of strict measurement, i.e. location 
along an attribute. It is, however, conceivable that one may wish to map into 
other sorts of spaces (which is not strictly measurement) and that it will be helpful 
to remember that the test space and the attribute space must correspond in 
Certain ways, as well as that information loss and error must be low if the test 


is to give the mapping required. 
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NOTES AND CORRESPONDENCE 


A NOTE ON AN EXCEPTION TO THE RULE OF DECREASING 
EIGENVALUES IN SUCCESSIVE FACTOR ITERATIVE METHODS 


By H. F. PRIssT 
University of Canterbury, Christchurch, New Zealand 


In basic structure successive factor methods it is assumed that eigenvalues will be found 
in descending order of magnitude. However, this is conditional not only on a sufficient 
number of iterations being carried out, but also on there being no vector in the matrix 
which is orthogonal or nearly orthogonal to the trial vector. An actual case is quoted. 


As larger computers are brought into use, successive factor iterative methods may 
become obsolete. However, as of now there are plenty of smaller machines in use and, 
at least for didactic purposes, the method may be with us for a long time yet. 

In basic structure successive factor methods, it is customary to set up a first trial 
Vector which has all values at unity. It is then assumed that successive eigenvectors 
will have eigenvalues in decreasing order of magnitude. The ‘root one’ criterion for 
stopping the extraction,of factors—widely used despite objections (Horn, 1966)—depends 
On the truth of this assumption. So, of course, does any other such criterion. We are 
Warned that this assumption is conditional, e.g. ‘ This is true in the exact mathematical 
Sense only if the iterations for each vector have been carried far enough ’ (Horst, 1965, 
P. 158). ‘The assumption, however, carries with it another condition. The other 
condition is that there is no vector in the matrix which is orthogonal or very nearly 
Orthogonal to the trial vector. When this second condition is violated the eigenvalues 
Will not necessarily be produced in descending order of magnitude. 

. A particular matrix of correlations (which arose from a matrix of real data collected 
In a clinical setting) is as in Table 1. 


TABLE 1. CORRELATIONS 


58 
-11 -25 
76 85 —-23 


75 66 -27 85 
67 88 -17 65 61 

87 87 -22 9% 80 176 

= =25 11 -23, -27 -17 =22 


OAIAuPSL- 


‘The curious relation of variables 3 and 8 is the point of interest here. 

Horst’s program 7.4 (1965, p. 608) ‘ Solution Without Residuals ? is used to demon- 
Strate the point. If the above matrix is factored with this program (making only those 
mend nts necessary for it to run on an IBM 1620) it produces two vectors with 
lgenygjyes 4:98 and 0:820. Having found an eigenvalue less than one, it stops. If 

€ program is then amended to extract further vectors, it produces eigenvalues 4-98, 
0.820, 1.111, 0-570, 0-299, etc. in that order. If the program is again amended, so that 
1t sets upa first trial -yeetor A(, L) =1 (effectively 1, 2, 3, 4, etc.), then it produces the 
Same Vectors and eigenvalues, but this time in descending order of magnitude of 


e 
‘Senvalues. 
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ণ্চ ix 1 ‘Table 2 wa: 
‘0 show that number of iterations is not relevant here, the matrix ir oi el 
duced using Horst’s program 7.4 (0p. cit.), with a variable stop amendm: he un 
pro 


ন 40; ions and eight 
trial vector, a comparison value (P) of 0:00000001, allowing up to 40 iterations 
decimal places. 


TABLE 2. EIGENVECTORS, EIGENVALUES AND ITERATIONS 
1 2 3 4 

0-845 0-282 0:000 = 0:292 

0-901 = 0-045 0-000 0-386 

=0:275 0-595 = 0-745 0-021 

0-942 0-010 0-000 - 0-144 

0.873  _0.-081 0-000 —0:323 

0-836 0-119 0:000 0-454 

0-975 0-093 0-000 — 0-052 

0-275 0-595 0-745 0-021 

Iterations 6 14 26 16 
Latent roots 4-983 0-820 1-111 0-570 
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BOOK REVIEW 


Timing in Temporal Tracking. By J. A. MicHOoN. Soesterberg: Institute for 

Perception, RVO-TNO, 1967. Pp. 127. 

In this monograph Michon discusses a series of experiments and develops a model 
accounting for the accuracy with which subjects can tap either at a given rate or in 
synchrony with a train of clicks. As such it would appear to be only relevant to 
psychologists whose research interests already include this narrow area. This would, 
however, be a very superficial assessment, for the monograph deserves a much wider 
readership. 

Michon sees his experimental paradigm as a microcosm in which mechanisms 
important in a wide range of behaviours can be more easily studied in detail; in particular 
those mediating the exact timing of actions, and which are now considered essential in 
both psycholinguistics and motor skills. ‘The relation between this experimental 
microcosm and more complex behaviours is clearly discussed, a welcome change from 
the common strategy of some psychologists, of apparently taking for granted the value of 
studying a certain microcosm, simply because other psychologists have studied it. 

One might expect the author, in traditional fashion, simply to state his model of how 
the subject behaves in this experimental situation, then to discuss whether or not the 
experimental results differ significantly from his theoretical predictions, and finally to 
speculate on why the differences, if any, occur. Instead, the author’s approach to theory 
construction is much more subtle. He begins with what seems to him to be the simplest 
conceivable model which could account for these elementary tapping tasks. ‘Then, by a 
sort of dialectical process, he first elaborates the model slightly in a plausible way, so that 
it simulates more closely human performance, then complicates the experimental situation 
in order to test this new model, and so on. An example of the elaborations he introduces 
into the model is to assume that the memory of the time interval to be reproduced by the 
Subject itself fades with time, whilst one of the experimental complications he uses is to 
Vary systematically the inter-click intervals, so as to check the assumption that only the 
previous inter-click interval and the error in responding to it are used for predicting the 
next inter-tap interval. An additional task is devised in order to investigate whether, as 
would be expected, only particular parameters of the model are affected by divided 
attention. Throughout the development of the model, Michon uses both information 
Processing and linear systems analysis, with the sequence of inter-click intervals being 
considered as input, and the sequence of inter-tap intervals as output. ‘The model is 
finally presented as a computer program in order to demonstrate its internal consistency. 

‘The one rather striking set of empirical results Michon has not yet incorporated 
into his model concerns the existence of temporal periodicities in the inter-tap interval 
distributions. ‘To complicate matters, these periodicities vary with changes in the 
situation, but have a median frequency of about 12 cycles per second, a value which 
resembles some found in perception, such as the maximum subjective rate of perceived 
flicker, Perhaps this phenomenon is correlated to the findings that responses are made 
more frequently in one part of the motor cortex alpha cycle than during the rest of the 
Cycle, 

The author develops the mathemati 
that the monograph should prove compre. 


cal techniques he uses from first principles, so 
hensible to most experimental psychologists. 
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